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This Intematioriat Search Report has been prepared by this International Searching Authority and is transmitted to the applicant 
according to Article 1 8. A copy is being transmitted to the International Bureau. 



. sheete. 



This International Search Report consists of a total of 5 

Pn It is also accompanied by a copy of each prior art document cited in this report. 



1 . Basis of the report 

a. With regard to the language, the international search was carried out on the basis of the international application in the 
language in which it was filed, unless otherwise Indicated under this item. 

I I the international search was carried out on the basis of a translation of the international application furnished to this 
Authority (Rule 23.1 (b)). 

b. With regard to any nucleotide and/or amino acid sequence disclosed in the Intemational application, the international search 
was carried out on the basis of the sequence listing : 

I I contained in ttie international application in written form. 

filed together with the intemational application in computer readable form. ^ . ^ 

furnished subsequentiy to this Authority in written form, 
furnished subsequently to ttiis Authority in computer readble form. 



2. 
3. 



□ 



□ 
□ 



the statement that the subsequentiy fumished written sequence listirig does not go beyond the disclosure in the 
international application as filed has been fumished. 

the statement ttiat the information recorded in computer readable form is identical to the written sequence listing has been 
furnished 

Certain claims were found unsearchable (See Box I). 
Unity of Invention is lacking (see Box II). 



4. With regard to the title, 

pn the text is approved as submitted by the applicant. 

I I the text has been established by this Authority to read as follows: 



5. With regard to the abstract, 

\T\ the text is approved as submitted by the applicant. 

I I the text has been established, according to Rule 38.2(b), by this Auttiority as it appears in Box III. The applicant may, 
' — ' within one montti from the date of mailing of this international search report, submit comments to this Authority. 

6. The figure of the drawings to be published with the abstract is Rgure No. 12 



pn as suggested by the applicant. None of the figures. 

I I because the applicant failed to suggest a figure. 

I I because this figure better characterizes the invention. 
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"A" document defining the general state of the art which is not 
considered to be of particular relevance 

"E" eartier document but published on or after the international 
filing date 

"L" document which may throw doubts on priority claim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use, exhtoition or 
other means 

*P" document published prior to the international filing date but 
later than the priortty date claimed 



T" later documerrt published after tlie intematior^l filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underiying the 
invention 

"X" document of particular relevance; the claimed Invention 
cannot be considered novel or cannot be considered to 
involve £m inventive step when the document is taken alone 

■Y" document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step vrtien the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person sidlled 
in the art. 
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Date of mailing of the international search report 

31/08/2000 
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1 . This imemaiional preliminary examination report has been prepared by thte IntemaUonal PrBltmlnary Examining Authority 
and is transmitted to the applicant according to Article 96. 

2. This REPORT consists of a total of 6 sheets, including this cover sheet 

□ This report is also accompanied by ANNEXES, i.e. sheets of the description, claims and/or drawings which have 
been amended and are the baste for this report and/or sheets containing recllficatfons made before this Authority 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of sheets. 



3. This report contains indications relating to the following items: 



1 




M 


□ 


III 


□ 


IV 




V 


IS 


VI 


n 


Vtl 


□ 


VIII 


□ 



Lack of unity of invention 

Reasoned statement under Article 35(2) with regard to novelty, inventive step or Industrial applicability 
citations and explanations suporting such statement 



Date of submiseion of me demand 



06/10/2000 



Name and mailing address of the international 
preliminary examining auttiority: 

European Patent Office 

^1 0-80298 Munich 

sy' Tel. +49 B9 2399 • 0 Tx: 523656 epmu d 
Fax:+49 89 2399 - 4465 



Fomi PCT/lPEA/409 (cover sheer) (January 1994) 



Date of oompletion of this report 



23.07^001 



Authorized officer 
Vollbach, S 

Telephone No. +48 B9 2TO 871 S 




19/07 01 THU 11:44 [TX/RX NO 6001) 



19. JUL. 2001 12:42 EPA MUEMCHEN +49 89 23994465 NR. 7033 S. 3/7 

INTERNATIONAL PR NARY 

EXAMINATION REPORT IntematJonaJ application No. PCT/USOO/071 07 

I. Basis of the report 

1 . With regard to the elements of the international application (Beplacement sheets which have been furnished to 
the receiving Office in response to an invitation under Arme 14 are referred to in this report as "ohginaliy filed" 
and are not annexed to this report since they do not contain amendments (Rates 70. 16 and 70. 17)): 
Description, pages: 

1-61 as originally filed 

Claims, No.: 

1 -59 as originally filed 

Drawings, No.: 

1-19 as Originally filed 

Sequence listing pari of the descriptloni pages: 
1 -56, filed with the letter of 24.7.00 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless othenvise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the intemattonal search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 4a.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

□ contained in the international application in written form. 

□ filed together with the international application in computer readable form. 
S furnished subsequently to this Authority in written form. 

B fumished subsequently to this Authority in computer readable form. 

H TTie statement that the subsequently fumished written sequence listing does not go beyond the disclosure in 
the international application as filed has been fumished. 

H The statement that the infomiation recorded in computer readable fonn is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 
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INTERNATIONAL PRE^ShsiARY ^ 

EXAMINATION REPORT International application No. PCT/USOO/07107 

□ the description* pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 

5- □ This report has been established as if (some of) the amendments had not been made, since they have been 
considered to go beyond the disclosure as filed (Rule 70.2(c)}: 

(Any replacement sheet containing such amenaments must t>9 referred to under item 1 and annexed to this 
report.) 

6. Additional obsen/ations, if necessary: 



IV. Lack of unity of invention 

1 . In response to the invitation to restrict or pay additional fees the applicant has: 

□ restricted the claims. 

□ paid additional fees. 

□ paid additional fees under protest. 

El neither restricted nor paid additional fees. 

2. □ This Authority found that the requirement of unity of invention is not complied and chose, acoording to Rule 

68.1 , not to invite the applicant to restrict or pay additional fees. 

3. This Authority considers that the requirement of unity of invention in accordance with Rules 13.1, 13.2 and 13.3 is 

□ complied with. 

□ not complied with for the following reasons: 

4. Consequently, the following parts of the ffitemational application were the subject of international preliminary 
examination in establishing this report 

□ alf parts. 

El the parts relating to claims Nos. 3^1 ,37 all partially. 

V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrlaf appficabilrty; 
citations and explanations supporting such atatement 

1. Statement 

Novelty (N) Yes: Claims 

No: Claims 3.21,37 
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NARY 



International application No, PCT/USOO/071 07 



Inventive step (IS) 



Yes: Claims 



No: Claims 3^1,37 

Industrial applicability (lA) Yes: Claims 3,21.37 

No: Claims 



2. Citations and explanations 
sea separate sheet 
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Re Item IV 

Lack of unity of invention 

The filing of an amended set of claims in response to an invitation to pay additional fees 
or to restrict the application to one group of inventions is neither foreseen in the PCT 
phase nor does it constitute a suitable means In order to fomri a basis for further 
examination. 

In addition, according to the Search Examiner the sequence ID No, 16 did not even form 
part of the search since it was not included in the claims as filed. 

Finally, this Authority has serious doubts whether the amendments filed meet the 
requirements of Article 34(2)(b) PCT. 

Anyhow, in view of these considerations, the examination has to be can-ied out on the 

basis of the group indicated in the invitation i.e. the first sequence of Fig. 4, i.e. the claims 

insofar as they relate to glucuronidase from Staphylococcus. 

This group of alleged invention comprises claims 3, 21 and 37 all partially. 

These claims are the only claims which may constitute a basis for this opinion since no 

further claim refer to said claim. 

It should be made clear that certain claims could be made dependent on these claims e.g. 
method claims or vector claims etc (which at present is not the case). 

Re item V 

Reasoned statement under Rule 66.2(a)(ii) with regard to novelty, inventive step or 
industrial applicability; citations and explanations supporting such statement 

Examination is been carried out on the basis of the group of invention constituted by claims 
3, 21 and 37 all partially. 

The specific sequence encoding the glucuronidase from Staphylococcus is novel (Article 
33(2) PCT). 

This, however, does not necessarily apply for Variants thereof" which are only limited by 
the function. Since said function is common to known E. coli glucurondase, the claims in 
this respect include the subject-matter of the prior art and therefore lack novelty (Article 
33(2) PCT). 



Form PCT/Separaie SheeV409 (Sheei 1) (EPO-AprO 1997) 



19/07 01 THU 11:44 [TX/RX NO 6001] 



19. JUL. 2001 12:43 EPA MUENCHEN +49 89 23994465 NR. 7033 S. 1/1 

INTERNATIONAL Pf^HI^INARY Irrtemational applic^f^No. PCT/USOO/07107 
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In any case, even rf novelty were established by deleting the variants from the claims, an 
inventive step has to be denied. The reason is that In D1 (WILSON K J ET AL: The 
Escherichia coli gus operon: induction and expression of the gus operon in E. coll and the 
occurrence and use of GUS in other bacteria' GUS PROTOCOLS: USING THE GUS 
GENE AS A REPORTER OF GENE EXPRESSION, 1992. pages 7-22, XP002093517) 
the existence of the GUS activity In Staphylococcus is well documented. The isolation of 
the con-esponding gene by using the known DNA sequence from the E. coli enzyme (or 
the detemiination of conserved regions by comparison of the E. coll enzyme with other 
known GUS followed by the preparation of suitable oligo's, as has been done by the 
applicant) for screening the DNA of Staphylococcus merely requires routine 
experimentation but is devoid of any inventive merit. 

Therefore the claims do not meet the requirements of Article 33(3) POT. 
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made before the expiration of 19 months from the priority date or, where Rule 32 applies, within the time limit under 
Rule 32.2(b). 
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MICROBIAL p-GLUCURONIDASE GENES, GENE PRODUCTS 
AND USES THEREOF 



5 TECHNICAL FIELD 

The present invention relates generally to microbial p-glucuronidases, 
and more specifically to secreted forms of p-glucuronidase, and uses of these p- 
glucuronidases. 

10 BACKGROUND OF THE IN'VENTION 

The enzyme p-glucuronidase (GUS; E.C.3.2.1.31) hydrolyzes a wide 
variety of glucuronides. Virtually any aglycone conjugated to D-glucuronic acid 
through a p-O-glycosidic linkage is a substrate for GUS. In vertebrates, glucuronides 
containing endogenous as well as xenobiotic compounds are generated through a major 

15 detoxification pathway and excreted in urine and bile. 

Escherichia coli, the major organism resident in the large intestine of 
vertebrates, utilizes the glucuronides generated in the liver and other organs as an 
efficient carbon source. Glucuronide substrates are taken up by E. coli via a specific 
transporter, the glucuronide permease (U.S. Patent No. 5,288,463 and 5,432,081), and 

20 cleaved by p-glucuronidase, releasing glucuronic acid residues that are used as a carbon 
source. In general, the aglycone component of the glucuronide substrate is not used by 
E. coli and passes back across the bacterial membrane into the gut to be reabsorbed into 
the bloodstream and undergo glucuronidation in the liver, beginning the cycle again. In 
E. coli, p-glucuronidase is encoded by the gusA gene (Novel and Novel, MoL Gen. 

25 Genet 720:319-335, 1973), which is one member of an operon comprising two other 
protein-encoding genes, gusB encoding a permease (PER) specific for p-glucuronides, 
and gusC encoding an outer membrane protein (OMP) that facilitates access of 
glucuronides to the permease located in the inner membrane. 

While p-glucuronidase activity is expressed in almost all tissues of 

30 vertebrates and their resident intestinal flora, GUS activity is absent in most other 
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organisms. Notably, plants, most bacteria, fungi, and insects are reported to largely, if 
not completely, lack GUS activity. Thus, GUS is ideal as a reporter molecule in these 
organisms and has become one of the most widely used reporter systems for these 
organisms. 

5 In addition, because both endogenous and xenobiotic compounds are 

generally excreted from vertebrates as glucuronides, P-glucuronidase is widely used in 
medical diagnostics, such as drug testing. In therapeutics, GUS has been used as an 
integral component of prodrug therapy. For example, a conjugate of GUS and a 
targeting molecules, such as an antibody specific for a tumor cell type, is delivered 

10 along with a nontoxic prodrug, provided as a glucuronide. The antibody targets the cell 
and GUS cleaves the prodrug, releasing an active drug at the target site. 

Because the E. coli GUS en2:yme is much more active and stable than the 
mammalian enzyme against most biosynthetically derived B-glucuronides (Tomasic and 
Keglevic, Biochem J J33:7S9, 1973; Lewy and Conchie, 1966), the E. coli GUS is 

15 preferred in both reporter and medical diagnostic systems. 

Production of GUS for use in in vitro assays, such as medical 
diagnostics, however, is costly and requires extensive manipulation as GUS must be 
recovered from cell ly sates. A secreted form of GUS would reduce manufacturing 
expenses, however, attempts to cause secretion have been largely unsuccessful. In 

20 addition, for use in transgenic organisms, the current GUS system has somewhat limited 
utility because enzymatic activity is detected intracellular! y by deposition of toxic 
colorimetric products during the staining or detection of GUS. Moreover, in cells that 
do not express a glucuronide permease, the cells must be permeabilized or sectioned to 
allow introduction of the substrate. Thus, this conventional staining procedure 

25 generally results in the destruction of the stained cells. In light of these limitations, a 
secreted GUS would facilitate development of non-destructive marker systems, 
especially useful for agricultural field work. 

Furthermore, the E. coli enzyme, although more robust than vertebrate 
GUS, has characteristics that limit its usefulness. For example, it is heat-labile and 
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inhibited by detergents and end product (glucuronic acid). For many applications, a 
more resilient enzyme would be beneficent. 

The present invention provides gene and protein sequences of microbial 
P-glucuronidases, variants thereof, and use of the proteins as a transformation marker, 
5 effector molecule, and component of mediced diagnostic and therapeutic systems, while 
providing other related advantages. 

SUMMARY OF INVENTION 

In one aspect, an isolated nucleic acid molecule is provided comprising a 

10 nucleic acid sequence encoding a microbial of p-glucuronidase, provided that the p- 
glucuronidase is not from E. coli. Nucleic acid sequences are provided for P- 
glucuronidases from Thermotoga, Staphylococcus^ Staphylococcus, Salmonella, 
Enterobacter, and Pseudomonas. In certain embodiments, the nucleic acid molecule 
encoding P-glucuronidase is derived from a eubacteria, such as purple bacteria, gram(+) 

15 bacteria, cyanobacteria, spirochaetes, green sulphur bacteria, bacteroides and 
flavobacteria, planctomyces, chlamydiae, radioresistant micrococci, and thermotogales. 

In another aspect, microbial p-glucuronidases are provided that have 
enhanced characteristics. In one aspect, thermostable P-glucuronidases and nucleic 
acids encoding them are provided. In general, a thermostable p-glucuronidase has a 

20 half-life of at least 10 min at 65°C. In preferred embodiments, the thermostable p- 
glucuronidase is from Thermotoga or Staphylococcus groups. In other embodiments, 
the P-glucuronidase converts at least 50 nmoles of p-nitrophenyl-glucuronide to p- 
nitrophenyl per minute, per microgram of protein. In even further embodiments, the p- 
glucuronidase retains at least 80% of its activity in 10 mM glucuronic acid. 

25 In another aspect, fusion proteins of microbial p-glucuronidase or an 

enzymatically active portion thereof are provided. In certain embodiments, the fusion 
partner is an antibody or fragment thereof that binds antigen. 

In other aspects, expression vectors comprising a gene encoding a 
microbial p-glucuronidase or a portion thereof that has enzymatic activity in operative 

30 linkage with a heterologous promoter are provided. In such a vector, the microbial p- 
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glucuronidase is not E. coli p-glucuronidase. In the expression vectors, the 
heterologous promoter is a promoter selected from the group consisting of a 
developmental type-specific promoter, a tissue type-specific promoter, a cell type- 
specific promoter and an inducible promoter. The promoter should be functional in the 
5 host cell for the expression vector. Examples of cell types include a plant cell, a 
bacterial cell, an animal cell and a fungal cell. In certain embodiments, the expression 
vector also comprises a nucleic acid sequence encoding a product of a gene of interest 
or portion thereof. The gene of interest may be under control of the same or a different 
promoter. 

10 Isolated forms of recombinant microbial p-glucuronidase are 3ilso 

provided in this invention, provided that the microbial p-glucuronidase is not E. coli p- 
glucuronidase. The recombinant p-glucuronidases may be from eubacteria, archaea, or 
eucarya. When eubacteria p-glucuromdases are clones, the eubacteria is selected from 
purple bacteria, gram(+) bacteria, cyanobacteria, spirochaetes, green sulphur bacteria, 

15 bacteroides and flavobacteria, planctomyces, chlamydiae, radioresistant micrococci, and 
thermotogales and the like. 

The present invention also provides methods for monitoring expression 
of a gene of interest or a portion thereof in a host cell, comprising: (a) introducing into 
the host cell a vector construct, the vector construct comprising a nucleic acid molecule 

20 according to clziim 1 and a nucleic acid molecule encoding a product of the gene of 
interest or a portion thereof; (b) detecting the presence of the microbial p-glucuronidase, 
thereby monitoring expression of the gene of interest; methods for transforming a host 
cell with a gene of interest or portion thereof, comprising: (a) introducing into the host 
cell a vector construct, the vector construct comprising a nucleic acid sequence 

25 encoding a microbial p-glucuronidase, provided that the microbial p-glucuronidase is 
not E. coli p-glucuronidase, and a nucleic acid sequence encoding a product of the gene 
of interest or a portion thereof, such that the vector construct integrates into the genome 
of the host cell; and (b) detecting the presence of the microbial p-glucuronidase, thereby 
establishing that the host cell is transformed. 
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Methods are also provided for positive selection for a transformed cell, 
comprising: (a) introducing into a host cell a vector construct, the vector construct 
comprising nucleic acid sequence encoding a microbial p-glucuronidase, provided that 
the microbial p-glucuronidase is not E. coli P-glucuronidase; (b) exposing the host cell 
5 to the sample comprising a glucuronide, wherein the glucuronide is cleaved by the p- 
glucuronidase, such that the compound is released, wherein the compound is required 
for cell growth. In all these methods, a microbial glucuronide permease gene may be 
also introduced. 

Transgenic plants expressing a microbial p-glucuronidase other than E. 

10 coli p-gluciu:onidase are also provided. The present invention also provides seeds of 
transgenic plants. Transgenic animals, such as aquatic animals are also provided. 
Methods for identifying a microorganism that secretes p-glucuronidase, are provided 
comprising: (a) culturing the microorganism in a medium containing a substrate for P- 
glucuronidase, wherein the cleaved substrate is detectable, and wherein the 

15 microorganism is an isolate of a naturally occurring microorganism or a transgenic 
microorganism; and (b) detecting the cleaved substrate in the medium. In certain 
embodiments, the microorganism is cultured under specific conditions that are 
favorable to particular microorganisms. 

In another aspect, a method for providing an effector compound to a cell 

20 in a transgenic plant is provided. The method comprises (a) growing a transgenic plant 
that comprises an expression vector, comprising a nucleic acid sequence encoding a 
microbial p-glucuronidase in operative linkage with a heterologous promoter and a 
nucleic acid sequence comprising a gene encoding a cell surface receptor for an effector 
compoimd and (b) exposing the transgenic plant to a glucuronide, wherein the 

25 glucuronide is cleaved by the P-glucuronidase, such that the effector compound is 
released. This method is especially useful for directing glucuronides to particular and 
specific cells by further introducing into the transgenic plant a vector construct 
comprising a nucleic acid sequence that binds the effector compound. The effector 
compoimd can then be used to control expression of a gene of interest by linking a gene 

30 of interest with the nucleic acid sequence that binds the effector compound. 
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These and other aspects of the present invention will become evident 
upon reference to the following detailed description and attached drawings. In addition, 
various references are set forth below which describe in more detail certain procedures 
or compositions {e.g.^ plasmids, etc.), and are therefore incorporated by reference in 
5 their entirety. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 presents DNA sequence of an approximately 6 kb fragment that 
encodes p-glucuronidase from Staphylococcus. 
10 Figure 2 is a schematic of the DNA sequence of a Staplrylococcus 6 kb 

fragment showing the location and orientation of the major open reading frames. 
S-GUS is p-glucuronidase. 

Figures 3A-B present amino acid sequences of representative microbial 
p-glucuronidases. 

15 Figures 4A-J present DNA sequences of representative microbial 

p-glucuronidases. 

Figures 5A-C present amino acid alignments of Staphylococcus GUS 
(SGUS) E. coll GUS (EGUS) and human GUS (HGUS)(5A). Microbial GUSes (5B) 
and nucleotide sequence alignments of Staphylococcus, Salmonella, and Pseudomonas 
20 p-glucuronidases. 

Figure 6 is a graph showing that Staphylococcus GUS is secreted in E. 
coli transformed with an expression vector encoding Staphylococcus GUS. The 
secretion index is the percent of total activity in periplasm less the percent of total p- 
galactosidase activity in periplasm. 
25 Figure 7 is a graph illustrating the half-life of Staphylococcus GUS and 

E. coli GUS at 65°C. 

Figure 8 is a graph showing the turnover number of Staphylococcus GUS 
and E. coli GUS enzymes at 37°C. 

Figure 9 is a graph showing the turnover nimiber of Staphylococcus GUS 
30 and E. coli GUS enzymes at room temperature. 
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Figure 10 is a graph presenting relative enzyme activity of 
Staphylococcus GUS in various detergents. 

Figure 11 is a graph presenting relative enzyme activity of 
Staphylococcus GUS in the presence of glucuronic acid. 
5 Figure 12 is a graph presenting relative enzyme activity of 

Staphylococcus GUS in various organic solvents and in salt. 

Figures 13A-C present a DNA sequence of Staphylococcus GUS that is 
codon-optimized for production in E. coli. 

Figure 14 is a schematic of the DNA sequence of Staphylococcus GUS 
10 that is codon-optimized for production in E. coli. 

Figure 15 presents schematics of two expression vectors for use in yeast 
(upper figure) and plants (lower figure). 

Figure 16 is a DNA sequence of a Salmonella gene p--glucuronidase. 
Figure 17 is an amino acid sequence of a Salmonella gene p- 
15 -glucuronidase translated fi-om the DNA sequence. 

Figure 18A-C presents an alignment of amino acids of three p- 
* -glucuronidase gene products: Staph (Staphylococcus), E. coli, Sal (a Salmonella). 

Figure 19A-G presents an alignment of nucleotides of three p- 
-glucuronidases; Staph {Staphylococcus), E. coli, Sal {Salmonella). 

20 

DETAILED DESCRIPTION OF THE INVENTION 

Prior to setting forth the invention, it may be helpful to an understanding 
thereof to set forth definitions of certain terms that will be used hereinafter. 

As used herein, "P-glucuronidase" refers to an enzyme that catalyzes the 
25 hydrolysis of p-glucuronides. Assays and some exemplary substrates for determining p 
—glucuronidase activity, also known as GUS activity, are provided in U.S. Patent 
No. 5,268,463. In assays to detect p-glucuronidase activity, fluorogenic or 
chromogenic substrates are preferred. Such substrates include, but are not limited to, p- 
nitrophenyl p-D-glucuronide and 4-methylumbelliferyl p-D-glucuronide. 
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As used herein, a "secreted form of a microbial p-glucuronidase" refers 
to a microbial p-glucuronidase that is capable of being localized to an extracellular 
environment of a cell, including extracellular fluids, periplasm, or is membrane bound 
on the external face of a cell but is not an integral membrane protein. Some of the 
protein may be found intracellularly. The amino acid and nucleotide sequences of 
exemplary secreted P-glucuronidases are presented in Figures 1 and 16 and SEQ ID 

Nos.: 1, 2, and . Secreted microbial GUS also encompasses variants 

of p-glucuronidase. A variant may be a portion of the secreted P-glucuronidase and/or 
have amino acid substitutions, insertions, and deletions, either found naturally as a 
polymorphic allele or constructed. A variant may also be a fusion of all or part of GUS 
with another protein. 

As used herein, "percent sequence identity" is a percentage determined 
by the number of exact matches of amino acids or nucleotides to a reference sequence 
divided by the number of residues in the region of overlap. Within the context of this 
invention, preferred amino acid sequence identity for a variant is at least 75% and 
preferably greater than 80%, 85%, 90% or 95%. Such amino acid sequence identity 
may be determined by standard methodologies, including use of the NationaJ Center for 
Biotechnology Information BLAST search methodology available at 
www.ncbi.nlm.nih.gov. The identity methodologies preferred are non-gapped BLAST. 
However, those described in U.S. Patent 5,691,179 and Altschul ei ai, Nucleic Acids 
Res. 25:3389-3402, 1997, all of which are incorporated herein by reference, are also 
useful. Accordingly, if Gapped BLAST 2.0 is utilized, then it is utilized with default 
settings. Further, a nucleotide variant will typically be sufficiently similar in sequence 
to hybridize to the reference sequence under stringent hybridization conditions (for 
nucleic acid molecules over about 500 bp, stringent conditions include a solution 
comprising about 1 M Na+ at 25° to 30°C below the Tm; e.g., 5 x SSPE, 0.5% SDS, at 
65°C; see, Ausubel, et aL, Current Protocols in Molecular Biology, Greene Publishing, 
1995; Sambrook et aL, Molecular Cloning: A Laboratory Manual^ Cold Spring Harbor 
Press, 1989), Some variants may not hybridize to the reference sequence because of 
codon degeneracy, such as degeneracies introduced for codon optimization in a 
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particular host, in which case amino acid identity may be used to assess similarity of the 
variant to the reference protein. 

As used herein, a "glucuronide" or "p-giucuronide" refers to an aglycone 
conjugated in a hemiacetal linkage, typically through the hydroxy 1 group, to the CI of a 
5 free D-glucuronic acid in the p configuration. Glucuronides include, but are not limited 
to, O-glucuronides linked through an oxygen atom, S-glucuronides, linked through a 
sulfur atom, N -glucuronides, linked through a nitrogen atom and C-glucxironides, linked 
through a carbon atom {see, Button, Glucuronidation of Drugs and Other Compounds, 
CRC Press, Inc. Boca Raton, FL ppl3-15). P-glucuronides consist of virtually any 

10 compound linked to the CI -position of glucuronic acid as a beta anomer, and are 
typically, though by no means exclusively, found as an O-glycoside. p-glucuronides 
are produced naturally in most vertebrates through the action of UDP-glucuronyl 
transferase as a part of the process of solubilizing, detoxifying, and mobilizing both 
natural and xenobiotic compounds, thus directing them to sites of excretion or activity 

15 through the circulatory system. 

p-glucuronides in polysaccharide form are also common in nature, most 
abundantly in vertebrates, where they are major constituents of connective and 
lubricating tissues in polymeric form with other sugars such as N-acetylglucosamine 
{e.g., chondroitan sulfate of cartilage, and hyaluronic acid, which is the principle 

20 constituent of synovial fluid and mucus). Other polysaccharide sources of p 
-glucuronides occur in bacterial cell walls, e.g., cellobiuronic acid. P-glucuronides are 
relatively uncommon or absent in plants. Glucuronides and galacturonides found in 
plant cell wall components (such as pectin) are generally in the alpha configuration, and 
are frequently substituted as the 4-O-methyl ether; hence, such glucuronides are not 

25 substrates for p-glucuronidase. 

An "isolated nucleic acid molecule" refers to a polynucleotide molecule 
in the form of a separate fragment or as a component of a larger nucleic acid construct, 
that has been separated from its source cell (including the chromosome it normally 
resides in) at least once in a substantially pure form. Nucleic acid molecules may be 
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comprised of a wide variety of nucleotides, including DNA, RNA, nucleotide 
analogues, have protein backbones (e.g., PNA) or some combination of these. 

Microbial p-glucuronidase genes 

5 As noted above, this invention provides gene sequences and gene 

products for microbial p-glucuronidases including secreted p-glucuronidases. As 
exemplified herein, genes from microorganisms, including genes from Staphylococcus 
and Salmonella that encode a secreted p-glucuronidase, are identified and characterized 
biochemically, genetically, and by DNA sequence analysis. Exemplary isolations of p- 

iO glucuronidase genes and gene products from several phylogenetic groups, including 
Staphylococcus, Thermotoga, Pseudomonas, Salmonella, Staphylococcus, 
Enterobacter, Arthobacter, and the like, are provided herein. Microbial p- 
-glucuronidases from additional organisms may be identified as described herein or by 
hybridization of one of the microbial p-glucuronidase gene sequence to genomic or 

15 cDNA libraries, by genetic complementation, by function, by amplification, by 
antibody screening of an expression library and the like {see Sambrook et al^ infra 
Ausubel et aL, infra for methods and conditions appropriate for isolation of a p- 
glucuronidase from other species). 

The presence of a microbial p-glucuronidase may be observed by a 

20 variety of methods and procedures. Particularly useful screens for identifying p- 
-glucuronidase are biochemical screening and genetic complementation. Test samples 
containing microbes, may be obtained from sources such as soil, animal or human skin, 
saliva, mucous, feces, water, and the like. Microbes present in such samples include 
organisms from the phylogenetic domains, Eubacteria, Archaea, and Eucarya (Woese, 

25 Microbiol Rev. 58: 1-9, 1994), the Eubacteria phyla: purple bacteria (including the a, 
p, y, and 5 subdivisions), gram (+) bacteria (including the high G+C content, low G+C 
content, and photosynthetic subdivisions), cyanobacteria, spirochaetes, green sulphur 
bacteria, bacteroides and flavobacteria, planctomyces and relatives, chlamydiae, 
radioresistant micrococci and relatives, and therm otogales. It will be appreciated by 

30 those in the art that the names and number of the phyla may vary somewhat according 
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to the precise criteria for categorization {see Strunk et ai. Electrophoresis J 9: 554, 
1998). Other microbes include, but are not limited to, entamoebae, fungi, and protozoa. 

Colonies of microorganisms are generally obtained by plating on a 
suitable substrate in appropriate conditions. Conditions and substrates will vary 
5 according to the growth requirements of the microorganism. For example, anaerobic 
conditions, liquid culture, or special defined media may be used to grow the 
microorganisms. Many different selective media have been devised to grow specific 
microorganisms (see, e.g, Merck Media Handbook). Substrates such as deoxycholate, 
citrate, etc. may be used to inhibit extraneous and undesired orgsinisms such as gram- 

10 positive cocci and spore forming bacilli. Other substances to identify particuigir 
microbes (e.g,, lactose fermenters, gram positives) may also be used. A glucuronide 
substrate is added that is readily detectable when cleaved by p-giucuronidase. If GUS is 
present, the microbes will stain; a microbe that secretes [^-glucuronidase should exhibit 
a diffuse staining (halo) pattern surrounding the colony. 

15 A complementation assay may be additionally performed to verify that 

the staining pattern is due to expression of a GUS gene or to assist in isolating and 
cloning the GUS gene. Briefly, in this assay, the candidate GUS gene is transfected into 
an £. coll strain that is deleted for the GUS operon (e.g., KWl described herein), and 
the staining pattern of the transfectant is compared to a mock-transfected host. For 

20 isolation of the GUS gene by complementation, microbial genomic DN A is digested by 
e.g.^ restriction enzyme reaction and ligated to a vector, which ideally is an expression 
vector. The recombinants are then transfected into a host strain, which ideally is deleted 
for endogenous GUS gene {e.g., KWl). In some cases, the host strain may express 
GUS gene but preferably not in the compartment to be assayed. If GUS is secreted, the 

25 transfectant should exhibit a diffuse staining pattern (halo) surrounding the colony, 
whereas, the host will not. 

The microorganisms can be identified in myriad ways, including 
morphology, virus sensitivity, sequence similarity, metabolism signatures, and the like. 
A preferred method is similarity of rRNA sequence determined after amplification of 

30 genomic DNA. A region of rRNA is chosen that is flanked by conserved sequences that 
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will anneal a set of amplification primers. The amplification product is subjected to 
DNA sequence analysis and compared to known rRNA sequences described. 

In one exemplary screen, a bacterial colony isolated fi-om a soil sample 
displays a strong, diffuse staining pattern. The bacterium was originally identified as a 
5 Staphylococcus by sequence determination of 16S rRNA after amplification. 
Additional 16S sequence information shows that this bacterium is a Siaphylococcms. A 
genomic library fi^om this bacterium is constructed in the vector pBSII KS+. The 
recombinant plasmids are transfected into KWl , a strain deleted for the p-glucuronidase 
operon. One resulting colony, containing the plasmid pRAJal7.1, exhibited a strong, 

10 difcuse staining pattern similar to the original isolate. 

In other exemplary screens of microorganisms found in soil and in skin 
samples, numerous microbes exhibit a diffuse staining pattem around the colony or 
stained blue. The phylogenetic classifications of some of these are determined by 
sequence analysis of 16S rRNA. At least eight different genera are represented. 

15 Genetic complementation assays demonstrate that the staining pattem is most likely due 
to expression of the GUS gene. Not all complementation assays yield positive results, 
however, which may be due to the. background genotype of the receptor strain or to 
restriction enzyme digestion within the GUS gene. The DNA sequence and predicted 
amino acid sequences of the GUS genes fi-om several of these microorganisms found in 

20 these screens microorganisms are determined. 

A DNA sequence of the GUS gene contained in the insert of pRAJal7,l 

is presented in Figure 1 and as SEQ ID No: . A schematic of the insert is presented 

in Figure 2. The p-glucuronidase gene contained in the insert is identified by similarity 
of the predicted amino acid sequence of an open reading frame to the E. coli and human 

25 p-giucuronidase amino acid sequences (Figure 5 A). Overall, Staphylococcus P- 
-glucuronidase has approximately 47-49% amino acid identity to E. coli GUS and to 
human GUS. An open reading frame of Staphylococcus GUS is 1854 bases, which 
would result in a protein that is 618 amino acids in length. The first methionine codon, 
however, is unlikely to encode the initiator methionine. Rather the second methionine 

30 codon is most likely the initiator methionine. Such a translated product is 602 amino 
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acids long and is the sequence presented in Figures 3A-B and 4A-I. The assignment of 
the initiator methionine is based upon a consensus Shine-Dalgamo sequence found 
upstream of the second Met, but not the first Met, and alignment of the Staphylococcus, 
human, and E. coli GUS amino acid sequences. Furthennore, as shown herein, 
5 Staphylococcus GUS gene lacking sequence encoding the 16 amino acids is expressed 
in E. coli transfectants. In addition, the 1 6 amino acids (Met-Leu-Ile-Ile-Thr-Cys-Asn- 

His-Leu-His-Leu-Lys-Arg-Ser-Ala-Ile) SEQ ID No. are not a canonical signal 

peptide sequence. 



10 3A-B) that can serve as a site for N-glycosylation in the ER. Furthermore, unlike the E. 
coli and human p-glucuronidases, which have 9 and 4 cysteines respectively, the 
Staphylococcus protein has only a single Cys residue (residue 499 in Figures 3A-B). 



identical. The nucleotide sequence and its amino acid translate are shovm in Figs 16 

1 5 and 1 7. There are 7 cysteines and a single glycosylation site (Asn-Leu-Ser) at residue 
358 (referenced to the E. coli sequence). Atmino acid alignments are shown in Figure 
18 and nucleotide alignments in Figure 19. Salmonella GUS has 71% nucleotide 
identity to E. coli, 51% to Staphylococcus and 85% amino acid identity to E. coli and 
46% to Staphylococcus. 

20 The DNA sequences of GUS genes from Staphylococcus homini. 

Staphylococcus warneri, Thermotoga maritima (TIGR Thermotoga database). 
Enter obacter^ Salmonella, and Pseudomonas are presented in Figures 4 A- J and SEQ ID 

Nos. . Predicted amino acid sequences are shown in Figures 3A-B and SEQ ID 

Nos. . The amino acid sequences are shown in alignment in Figures 5A-C. The 

25 signature peptide sequences for glycosyl hydrolases (Henrissat, Biochem Sac Trans 
26:153, 1998; Henrissat B et al, FEES Lett 27/425, 1998) are located from amino acids 
333 to 358 and firom amino acids 406 to 420 {Staphylococcus numbering in Figures 3 A 
and 5B). The catalytic nucleophile is Glu 344 {Staphylococcus numbering) (Wong et 
al,J. Biol Chem, 18\ 34057, 1998). Within these two signature regions, 17/26 and 8/15 

30 residues are identical across the six presented sequences. At the non -identical positions. 



There is a single Asn-Asn-Ser sequence (residues 118-120 in Figures 



Two GUS sequences from Salmonella are analysed and found to be 
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most of the sequences share an identical residue. Thus, the sequences are highly 
conserved in these regions (identity between Staphylococcus and each other GUS gene 
ranges from 65% to 100% in signature 1 and from 73% to 100% in signature 2) {see 
Figure 5B). In contrast, between Staphylococcus and p-galactosidase, another glycosyl 
5 hydrolase that has signature sequences, identity is 46% in signature 1 and 73% in 
signature 2, 

In addition, portions or fragments of microbial GUS may be isolated or 
constructed for use in the present invention. For example, restriction fragments can be 
isolated by well-known techniques from template DNA, e.g., plasmid DNA, and DNA 

10 fragments, including, but limited to, digestion with restriction enzymes or amplification. 
Furthermore, oligonucleotides of 12 to 100 nt, 12 to 50 nt, 15 to 50 nt, can be 
synthesized or isolated from recombinant DNA molecules. One skilled in the art will 
appreciated that other methods are available to obtain DNA or RNA molecules having 
at least a portion of a microbial GUS sequence. Moreover, for particular applications, 

15 these nucleic acids may be labeled by techniques known in the art, such as with a 
radiolabel {e.g., '^P, ''P, ''S, '^'l' "'I, ^ ''C), fluorescent label {e.g., FITC, Cy5, RITC, 
Texas Red), chemiluminescent label, enzyme, biotin and the like. 

In certain aspects, the present invention provides fragments of microbial 
GUS genes. Fragments may be at least 12 nucleotides long {e.g., at least 15 nt, 17 nt, 

20 20 nt, 25 nt, 30 nt, 40 nt, 50 nt). Fragments may be used in hybridization methods {see, 
exemplary conditions described infra) or inserted into an appropriate vector for 
expression or production. In certain aspects, the fragments have sequences of one or 
both of the signatures or have sequence from at least some of the more highly conserved 
regions of GUS {e.g., from approximately amino acids 272-360 and from amino acids 

25 398-421 or from amino eicids 398-545; based on Staphylococcus numbering in Figure 
5B). In the various embodiments, useftil fragments comprise those nucleic acid 
sequences which encode at least the active residue at amino acid position 344 
{Staphylococcus numbering in Figure 5B) and, preferably, comprise nucleic acid 
sequences 697-1624, 703-1620, 751-1573, 805-1398, 886-1248, 970-1059, and 997- 

30 1044 {Staphylococcus numbering in Figures 4A-4C). In other embodiments. 
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oligonucleotides of microbial GUSes are provided especially for use as amplification 
primers. In such case, the oligonucleotides are at least 12 bases and preferably at least 
15 bases (e,g., at least 18, 21, 25, 30 bases) and generally not longer than 50 bases. It 
will be appreciated that any of these fi*agments described herein can be double-stranded, 
5 single-stranded, derived from coding strand or complementary strand and be exact or 
mismatched sequence. 

Microbial ^-glucuronidase gene products 

The present invention also provides p-glucuronidase gene products in 
10 various forms. Forms of the GUS protein include, but are not limited to, secreted 
forms, membrane-bound forms, cytoplasmic forms, fusion proteins, chemical 
conjugates of GUS and another molecule, portions of GUS protein, and other variants. 
GUS protein may be produced by recombinant means, biochemical isolation, and the 
like. 

15 In certain aspects, variants of secreted microbial GUS are useful within 

the context of this invention. Variants include nucleotide or amino acid substitutions, 
deletions, insertions, and chimeras (e.g., fusion proteins). Typically, when the result of 
synthesis, amino acid substitutions are conservative, Le., substitution of amino acids 
within groups of polar, non-polar, aromatic, charged, etc. amino acids. As will be 

20 appreciated by those skilled in the art. a nucleotide sequence encoding microbial GUS 
may differ from the wild-type sequence presented in the Figures, due to codon 
degeneracies, nucleotide polymorphisms, or amino acid differences. In certain 
embodiments, variants preferably hybridize to the wild-type nucleotide sequence at 
conditions of normal stringency, which is approximately 25-30°C below Tm of the 

25 native duplex (e.g., 1 M Na+ at 65°C; e.g. 5X SSPE, 0.5% SDS, 5X Denhardt's 
solution, at 65°C or equivalent conditions; see generally, Sambrook et al Molecular 
Cloning: A Laboraiory Manual, 2nd ed.. Cold Spring Harbor Press, 1987; Ausubel ef 
ai. Current Protocols in Molecular Biology, Greene Publishing, 1987). Altematively, 
the Tm for other than short oligonucleotides can be calculated by the formula Tm=8l.5 

30 + 0.41%(G-i-C) - log[Na+]. Low stringency hybridizations are performed at conditions 
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approximately 40°C below Tm, and high stringency hybridizations are performed at 
conditions approximately 10°C below Tm. 

Variants may be constructed by any of the well known methods in the art 
(see, generally, Ausubel et aL, supra; Sambrook et al.^ stipra). Such methods include 
5 site-directed oligonucleotide mutagenesis, restriction enzyme digestion and removal or 
insertion of bases, amplification using primers containing mismatches or additional 
nucleotides, splicing of another gene sequence to the reference microbial GUS gene, 
and the like. Briefly, preferred methods for generating a few nucleotide substitutions 
utilize an oligonucleotide that spans the base or bases to be mutated and contains the 

10 mutated base or bases. The oligonucleotide is hybridized to complementar>' single 
stranded nucleic acid and second strand synthesis is primed from the oligonucleotide. 
Similarly, deletions and/or insertions may be constructed by any of a variety of known 
methods. For example, the gene can be digested with restriction enzymes and religated 
such that some sequence is deleted or ligated with an isolated fragment having cohesive 

15 ends so that an insertion or large substitution is made. In another embodiment, variants 
are generated by shuffling of regions (see U.S. Patent No. 5,605,793). Variant 
sequences may also be generated by "niolecular evolution" techniques (see U. S. Patent 
No. 5,723,323). Other means to generate variant sequences may be found, for example, 
in Sambrook et al {supra) and Ausubel et al {supra). Verification of variant sequences 

20 is typically accomplished by restriction enzyme mapping, sequence analysis, or probe 
hybridization, although other methods may be used. The double-stranded nucleic acid 
is transformed into host cells, typically E. coli, but alternatively, other prokaryotes, 
yeast, or larger eukaryotes may be used. Standard screening protocols, such as nucleic 
acid hybridization, amplification, and DNA sequence analysis, can be used to identify 

25 mutant sequences. 

In addition to directed mutagenesis in which one or a few amino acids 
are altered, variants that have multiple substitutions may be generated. The 
substitutions may be scattered throughout the protein or functional domain or 
concentrated in a small region. For example, a region may be mutagenized by 

3D oligonucleolide-directed mutagenesis in which the oligonucleotide contains a string of 
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dN bases or the region is excised and replaced by a string of dN bases. Thus, a 
population of variants with a randomized amino acid sequence in a region is generated. 
The variant with the desired properties (e.g., more efficient secretion) is then selected 
from the population. 

5 In preferred embodiments, the protein and variants are capable of being 

secreted and exhibit p-glucuronidase activity. A GUS protein is secreted if the amount 
of secretion expressed as a secretion index is statistically significantly higher for the 
candidate protein compared to a standard, typically E. coli GUS. Secretion index 
maybe calculated as the percentage of total GUS activity in periplasm or other 

10 extracellular environment less the percentile of total P-galactosidase activit>' found in 
the same extracellular environment. 

In other preferred embodiments, a microbial GUS or its variant will 
exhibit one or more of the biochemical characteristics exhibited by Staphylococcus 
GUS, such as its increased thermal stability, its higher turnover number, and its activity 

15 in detergents, presence of end product, high salt conditions and organic solvents as 
compared to an E. coli GUS standard. 

In certain preferred embodiments, the microbial GUS is themiostable, 
having a half-life of at least 10 minutes at 65 °C {e.g., at least 14 minutes, 16 minutes, 
1 8 minutes). In other preferred embodiments, GUS protein has a turnover number, 

20 expressed as nanomoles of p-nitrophenyl-p-D-glucuronide converted to p-nitrophenol 
per minute per fig of purified protein, of at least 50 and more preferably at least 60, at 
least 70, at least 80 and at least 90 nanomoles measured at its temperature optimum. In 
other preferred embodiments the turnover number is at least 20, at least 30, or at least 
40 nanomoles at room temperature. In yet other preferred embodiments, the p 

25 -glucuronidase should not be substantially inhibited by the presence of detergents such 
as SDS {e.g., at 0.1%, 1%, 5%), Triton® X-100 {e.g., at 0.1%, 1%, 5%), or sarcosyl 
{e.g., at 0.1%, 1%, 5%). In other preferred embodiments, the GUS enzyme is not 
substantially inhibited {e.g., less than 50% inhibition and more preferably less than 20% 
inhibition) by either 1 mM or as high as 1 0 mM glucuronic acid. In still other preferred 

30 embodiments, GUS retains substantial activity (at least 50% and preferably at least 
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70%) in organic solvents, such as dimethyl formamide, dimethylsulfoxide and in salt 



being secreted and exhibit one or more of the biochemical characteristics disclosed 
5 herein. In other embodiments, variants of microbial GUS are capable of binding to a 
hapten, such as biotin, dinitrophenol, and the like. 



without enzymatic activity or be directed to other cellular compartments, such as 
membrane or cytoplasm. Membrane-spanning amino acid sequences are generally 

10 hydrophobic and many examples of such sequences are well-known. These sequences 
may be spliced onto microbial secreted GUS by a variety of methods including 
conventional recombinant DNA techniques. Similarly, sequences that direct proteins to 
cytoplasm {e.g., Lys-Asp-Glu-Leu) may be added to the reference GUS, typically by 
recombinant DNA techniques. 

15 In other embodiments, a fusion protein comprising GUS may be 

constructed from the nucleic acid molecule encoding microbial and another nucleic acid 
molecule. As will be appreciated, the fiision partner gene may contribute, within certain 
embodiments, a coding region. In preferred embodiments, microbial GUS is fused to 
avidin, streptavidin or an antibody. Thus, it may be desirable to use only the catalytic 

20 site of GUS (e.g., amino acids 415-508 reference to Staphylococcus sequence). The 
choice of the fusion partner depends in part upon the desired application. The fusion 
partner may be used to alter specificity of GUS, provide a reporter function, provide a 
tag sequence for identification or purification protocols, and the like. The reporter or 
tag can be any protein that allows convenient and sensitive measurement or facilitates 

25 isolation of the gene product and does not interfere with the function of GUS. For 
example, green fluorescent protein and p-galactosidase are readily available as DNA 
sequences. A peptide tag is a short sequence, usually derived from a native protein, 
which is recognized by an antibody or other molecule. Peptide tags include FLAG®, 
Glu-Glu tag (Chiron Corp., Emeryville, OA), KT3 tag (Chiron Corp.), T7 gene 10 tag 

30 (Invitrogen, La JoUa, CA), T7 major capsid protein tag (Novagen, Madison, Wl), His^ 



(e.g., NaCl). 



In other preferred embodiments, GUS and variants thereof are capable of 



In other embodiments, variants may exhibit glucuronide binding activity 
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(hexa-His), and HSV tag (Novagen). Besides tags, other types of proteins or peptides, 
such as glutathione-S-transferase may be used. 

In other aspects of the present invention, isolated microbial 
glucuronidase proteins are provided. In one embodiment, GUS protein is expressed as a 
hexa-His fusion protein and isolated by metal-containing chromatography, such as 
nickel-coupled beads. Briefly, a sequence encoding HiSg is linked to a DNA sequence 
encoding a GUS. Although the His^ sequence can be positioned anywhere in the 
molecule, preferably it is linked at the 3' end immediately preceding the termination 
codon. The His-GUS fusion may be constructed by any of a variety of methods. A 
convenient method is amplification of the GUS gene using a downstream primer that 
contains the codons for His^. 

In one aspect of the present invention, peptides having microbial GUS 
sequence are provided. Peptides may be used as immunogens to raise antibodies, as 
well as other uses. Peptides are generally five to 100 amino acids long, and more 
usually 10 to 50 amino acids. Peptides are readily chemically synthesized in an 
automated fashion (e.g., PerkinElmer, ABI Peptide Synthesizer) or may be obtained 
commercially. Peptides may- be further purified by a variety of methods, including 
high-performance liquid chromatography (HPLC). Fiuthermore, peptides and proteins 
may contain amino acids other than the 20 naturally occurring amino acids or may 
contain derivatives and modification of the amino acids. 

p-glucuronidase protein may be isolated by standard methods, such as 
affinity chromatography using matrices containing saccharose lactone, phenythio- p 
-gluctironide, antibodies to GUS protein and the like, size exclusion chromatography, 
ionic exchange chromatography, HPLC, and other known protein isolation methods. 
(see generally Ausubel et al. supra', Sambrook ei al. supra). The protein can be 
expressed as a hexa-His fusion protein and isolated by metal-affinity chromatography, 
such as nickel-coupled beads. An isolated purified protein gives a single band on SDS- 
PAGE when stained with Coomassie brilliant blue. 
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Antibodies to microbial GUS 

Antibodies to microbial GUS proteins, fragments, or peptides discussed 
herein may readily be prepared. Such antibodies may si>ecifically recognize reference 
microbial GUS protein and not a mutant (or variant) protein, mutant (or variant) protein 
5 and not wild type protein, or equally recognize both the mutant (or variant) and wild- 
type forms. Antibodies may be used for isolation of the protein, inhibiting (antagonist) 
activity of the protein, or enhancing (agonist) activity of the protein. 

Within the context of the present invention, antibodies 2ire understood to 
include monoclonal antibodies, polyclonal antibodies, anti-idiotypic antibodies, 

10 antibody fragments (e.g.. Fab, and F(ab')2, Fy variable regions, or complementiirity 
determining regions). Antibodies are generally accepted as specific against GUS 
protein if they bind with a of greater than or equal to 10"^ M, preferably greater than 
of equal to 10"^ M. The affinity of a monoclonal antibody or binding partner can be 
readily determined by one of ordinary skill in the art {see Scatchard, Ann. N. Y. Acad. 

15 Sci. 51:660-672, 1949). 

Briefly, a polyclonal antibody preparation may be readily generated in a 
variety of warm-blooded animals such as rabbits, mice, or rats. Typically, an animal is 
immunized with GUS protein or peptide thereof, which may be conjugated to a carrier 
protein, such as keyhole limpet hemocyanin. Routes of administration include 

20 intraperitoneal, intramuscular, intraocular, or subcutaneous injections, usually in an 
adjuvant (e.g., Freund*s complete or incomplete adjuvant). Particularly preferred 
polyclonal antisera demonstrate binding in an assay that is at least three times greater 
than background. 

Monoclonal antibodies may also be readily generated from hybridoma 
25 cell lines using conventional techniques (see U.S. Patent Nos. RE 32,011, 4,902,614, 
4,543,439, and 4,41 1,993; see also Antibodies: A Laboratory Manual, Harlow and Lane 
(eds,). Cold Spring Harbor Laboratory Press, 1988). Briefly, within one embodiment, a 
subject animal such as a rat or mouse is injected with GUS or a portion thereof. The 
protein may be administered as an emulsion in an adjuvant such as Freimd's complete or 
30 incomplete adjuvant in order to increase the immune response. Between one and three 
weeks after the initial immunization the animal is generally boosted and may tested for 
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reactivity to the protein utilizing well-known assays. The spleen and/or lymph nodes 
are harvested and immortalized. Various immortalization techniques, such as mediated 
by Epstein-Barr virus or fusion to produce a hybridoma, may be used. In a preferred 
embodiment, immortalization occurs by fusion with a suitable myeloma cell line (e.g., 
5 NS-1 (ATCC No. TIB 18), and P3X63 - Ag 8.653 (ATCC No. CRL 1580) to create a 
hybridoma that secretes monoclonal antibody. The preferred fusion partners do not 
express endogenous antibody genes. Following fusion, the cells are cultured in 
selective medium and are subsequently screened for the presence of antibodies that are 
reactive against a GUS protein. A wide variety of assays may be utilized, including for 

10 example countercurrent immuno-electrophoresis, radioimmunoassays, 
radioimmunoprecipitations, enzyme-linked immunosorbent assays (ELISA), dot blot 
assays, western blots, immunoprecipitation, inhibition or competition assays, and 
sandwich assays {see U.S. Patent Nos. 4,376,1 10 and 4,486,530; see also Antibodies: A 
Laboratory Manual, Harlow and Lane (eds.). Cold Spring Harbor Laboratory Press, 

15 1988). 

Otiier techniques may also be utilized to construct monoclonal antibodies 
(see Huse al. Science 2^(5:1275-1281, 1989: Sastry et al, Proc. Natl. Acad Sci. 
USA 55.-5728-5732, 1989; Alting-Mees et al. Strategies in Molecular Biology 5:1-9, 
1990; describing recombinant techniques). Briefly, RNA is isolated from a B cell 

20 population and utilized to create heavy and light chain immunoglobulin cDNA 
expression libraries in suitable vectors, such as >-immimoZap(H) and A,lmmunoZap(L). 
These vectors may be screened individually or co-expressed to form Fab fragments or 
antibodies {see Huse et al., supra; Sastry et aL, supra). Positive plaques may 
subsequently be converted to a non-lytic plasmid that allows high level expression of 

25 monoclonal antibody fragments from E. coli. 

Similarly, portions or fragments, such as Fab and Fv fragments, of 
antibodies may also be constructed utilizing conventional enzymatic digestion or 
recombinant DNA techniques to yield isolated variable regions of an antibody. Within 
one embodiment, the genes which encode the variable region from a hybridoma 

30 producing a monoclonal antibody of interest are amplified using nucleotide primers for 
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the variable region, which may be purchased from commercially available sources (e.g. , 
Stratacyte, La Jolla, CA) Amplification products are inserted into vectors such as 
ImmunoZAP™ H or ImniunoZAP''^'*^ L (Stratacyte), which are then introduced into E. 
coli, yeast, or mammalian-based systems for expression. Utilizing these techniques, 
5 large amounts of a single-chain protein containing a fusion of the Vh and Vl domains 
may be produced (see Bird et al.y Science 2^2:423-426, 1988). In addition, techniques 
may be utilized to change a "murine" antibody to a "human" antibody, without altering 
the binding specificity of the antibody. 



10 techniques for generating antibodies exist. In this regard, the following U.S. patents 
teach a variety of these methodologies and are thus incorporated herein by reference: 



U.S. Patent Nos. 5,840,479; 5,770,380; 5,204,244; 5,482,856; 5,849,288; 5,780,225; 
5,395,750; 5,225,539; 5,110,833; 5,693,762; 5,693,761; 5,693,762; 5,698,435; and 



purified by many techniques well known to those of ordinary skill in the art (see 
^ Antibodies: A Laboratory Manual y Harlow and Lane (eds.). Cold Spring Harbor 
Laboratory Press, 1988). Suitable techniques include peptide or protein affinity 
colunms, HPLC (e.g., reversed phase, size exclusion, ion-exchange), purification on 
20 protein A or protein G columns, or any combination of these techniques. 

Assays for function of p-glucuronidase 



enzymatic activity and in other preferred embodiments, will also have the capability of 
25 being secreted. As noted above, variants of these reference GUS proteins may exhibit 
altered functional activity and cellular localization. Enzymatic activity may be assessed 
by an assay such as the ones disclosed herein or in U.S. Patent No. 5,268,463 
(Jefferson). Generally, a chromogenic or fluorogenic substrate is incubated with cell 
extracts, tissue or tissue sections, or purified protein. Cleavage of the substrate is 
30 monitored by a method appropriate for the aglycone. 



One of ordinary skill in the art will appreciate that a variety of alternative 



5,328,834. 



15 



Once suitable antibodies have been obtained, they may be isolated or 



In preferred embodiments, microbial p-glucuronidase will at least have 
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A variety of metbods may be used to demonstrate that a p-glucuronidase 
is secreted. For example, a rapid screening method in which colonies of organisms or 
cells, such as bacteria, yeast or insect cells, are plated and incubated with a readily 
visualized glucuronide substrate, such as X-GlcA. A colony with a diffuse staining 
5 pattern likely secretes GUS, although such a pattern could indicate that the cell has the 
ability to pump out the cleaved glucuronide, that the cell has become leaky, or that the 
enzyme is membrane bound. The unlikely alternatives can be ruled out by using a host 
cell for transfection that does not pump out cleaved substrate and is deleted for 
endogenous GUS genes is preferably used, 

10 Secretion of the enzj-me may be verified by assaying for GUS activity' in 

the extracellular environment. If the cells secreting GUS are gram-positive bacteria, 
yeasts, molds, plants, or other organisms with cell walls, activity may be assayed in the 
culture medium and in a cell extract, however, the protein may not be transported 
through the cell wall. Thus, if no or low activity of a secreted form of GUS is found in 

15 the culture medium, protoplasts made by osmotic shock or enzymatic digestion of the 
cell wall or other suitable procedure and the supernatant are assayed for GUS activity. 
If the cells secreting GUS are gram-negative bacteria, culture supematant is tested, but 
more likely p-glucuronidase will be retained in the periplasmic space between the iimer 
and outer membrane. In this case, spheroplasts, made by osmotic shock, enzymatic 

20 digestion, or other suitable procedure and the supematant are assayed for GUS activity. 
Cells without cell walls are assayed for GUS in cell supematant and cell extracts. The 
fraction of activity in each compartment is compared to the activity of a non-secreted 
GUS in the same or similar host cells. A p-glucuronidase is secreted if significantly 
more enzyme activity than E, coli GUS activity is found in extracellular spaces. The 

25 amount of secretion is generally normalized to the amount of a non-secreted protein 
found in extracellular spaces. By this assay, usually less than 10% of E. coli GUS is 
secreted- Within the context of this invention, higher amounts of secreted enzyme are 
preferred {e.g., greater than 20%, 25%, 30%, 40%, 50%). 

p-glucuronidases that exhibit specific substrate specificity are also useful 

30 within the context of the present invention. As noted above, glucuronides can be linked 
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through an oxygen, carbon, nitrogen or sulfur atom. Glucuronide substrates having 
each of the linkages may be used in one of the assays described herein to identify 
GUSes that discriminate among the linkages. In addition, various glucuronides 
containing a variety of aglycones may be used to identify. GUSes that discriminate 
5 among the aglycones. 

Some readily available glucuronides for testing include, but are not 

limited to: 

Phenyl-p-glucuronide 
Phenyl p-D-thio-glucuronide 
p-Nitrophenyl- P-glucuronide 

4- MethylumbelUferyl-p- glucuronide 
p-Aminophenyl-P-D-glucuronide 
p-Aminophenyl- 1 -thio-P-D-glucuronide 
Chloramphenicol P-D-glucuronide 
8-Hydroxyquinoline p-D-glucuronide 

5- Bromo-4-chloro-3-indoIyl-p-D-glucuronide pC-GlcA) 

5- Bronio-6-ch)oro-3-indo!yl-p-D-gIucuronide (Magenta-GIcA) 

6- Chloro-3-indolyl-P-D-glucuronide (Salmon-p-D-GlcA) 
Indoxyl-p-D-glucuronide (Y-GlcA) 
Androsterone-3 - p -D-glucuroni de 
a-Naphthyl-p-D-glucuronide 
Estriol-3-p-D-glucuronide 

17 -p-Estradiol-3-p-D-giucuronide 

Estrone-3 - p- D-g 1 ucuron ide 

Testosterone- 1 7- p-D-glucuronide 

1 9-nor-Testosterone- 1 7- p-D-glucuronide 

Tetrahydrocortisone-3-P-D-glucuronide 

Phenolphthalein-p-D-glucuronide 

3 -Azido-3'-deoxythymidine- P-D-glucuronide 

Methyl-p-D-glucuronide 

Morphine-6- p-D-glucuronide 

Vectors, host cells and means of expressing and producing protein 

10 Microbial p-glucuronidase may be expressed in a variety of host 

organisms. For protein production and purification, GUS is preferably secreted and 
produced in bacteria, such as E. coli^ for which many expression vectors have been 
developed and are available. Other suitable host organisms include other bacterial 
species (e.g.. Bacillus, and eukaryotes, such as yeast {e.g., Saccharomyces cerevisiae)^ 
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mammalian cells {e.g., CHO and COS-7), plant cells and insect cells {e.g., Sf9). 
Vectors for these hosts are well known. 

A DNA sequence encoding microbial p-glucuronidase is introduced into 
an expression vector appropriate for the host. The sequence is derived from an existing 
5 clone or synthesized. As described herein, a fragment of the coding region may be 
used, but if enzyme activity is desired, the catalytic region should be included. A 
preferred means of synthesis is amplification of the gene from cDNA, genomic DNA, or 
a recombinant clone using a set of primers that flank the coding region or the desired 
portion of the protein. Restriction sites are typically incorporated into the primer 

10 sequences and are chosen with regard to the cloning site of the vector. If necessary, 
translational initiation and termination codons can be engineered into the primer 
sequences. The sequence of GUS can be codon-optimized for expression in a particular 
host. For example, a secreted form of p-glucuronidase isolated from a bacterial species 
that is expressed in a fungal host, such as yeast, can be altered in nucleotide sequence to 

15 use codons preferred in yeast. Codon-optiinization may be accomplished by methods 
such as splice overlap extension, site-directed mutagenesis, automated synthesis, and 
the like. . ^ 

At minimum, an expression vector must contain a promoter sequence 
Other regulatory sequences may be included. Such sequences include a transcription 

20 termination signal sequence, secretion signal sequence, origin of replication, selectable 
marker, and the like. The regulatory sequences are operationally associated with one 
another to allow transcription or translation. 

Expression in bacteria 

25 The plasmids used herein for expression of secreted GUS include a 

promoter designed for expression of the proteins in a bacterial host. Suitable promoters 
are widely available and are well known in the art. Inducible or constitutive promoters 
are preferred. Such promoters for expression in bacteria include promoters from the T7 
phage and other phages, such as T3, T5, and SP6, and the trp, Ipp, and lac operons. 

30 Hybrid promoters {see, U.S. Patent No, 4,551,433), such as tac and trc, may also be 
used. Promoters for expression in eukaryotic cells include the PIO or polyhedron gene 



wo 00/55333 




PCT/USOO/07107 



26 



promoter of baculovirus/insect cell expression systems {see^ e.g., U.S. Patent Nos. 
5^243,041, 5,242,687, 5,266,317, 4,745,051, and 5,169,784), MMTV LTR, RSV LTR, 
SV40, metallothionein promoter (see, e.g., U.S. Patent No. 4,870,009) and other 
inducible promoters. For protein expression, a promoter is inserted in operative linkage 
5 with the coding region for p-glucuronidase. 



controlled by a repressor. In some systems, the promoter can be derepressed by altering 
the physiological conditions of the cell, for example, by the addition of a molecule that 
competitively binds the repressor, or by altering the temperature of the growth media. 
10 Preferred repressor proteins include, but are not limited to the E. coli iacl repressor 
responsive to IPTG induction, the temperature sensitive Ax;I857 repressor, and the like. 
The E. coli lad repressor is preferred. 



terminator sequence. A "transcription terminator region" has either a sequence that 
15 provides a signal that terminates transcription by the polymerase that recognizes the 
selected promoter and/or a signal sequence for polyadenylation. 



bacterial hosts, the vector preferably contains a bacterial origin of replication. Preferred 
bacterial origins of replication include the fl-ori and col El origins of replication, 
20 especially the origin derived from pUC plasmids. 



functional in the host. A selectable gene includes any gene that confers a phenotype on 
the host that allows transformed cells to be identified and selectively grown. Suitable 
selectable marker genes for bacterial hosts include the ampicillin resistance gene 
25 (AmpO, tetracycline resistance gene (TcO and kanamycin resistance gene (KanO* 
Suitable markers for eukaryotes usually complement a deficiency in the host (e.g.., 
thymidine kinase (tk) in tk- hosts). However, drug markers are also available (e.g., 
G418 resistance and hygromycin resistance). 



30 a classical secretion signal, whereby the resulting peptide is a precursor protein 



The promoter controlling transcription of p-glucuronidase may be 



In other preferred embodiments, the vector also includes a transcription 



Preferably, the vector is capable of replication in host cells. Thus, for 



The plasmids also preferably include at least one selectable gene that is 



The sequence of nucleotides encoding p-glucuronidase may also include 
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processed and secreted. The resulting processed protein may be recovered from the 
periplasmic space or the fermentation medium. Secretion signals suitable for use are 
widely avzdlable and are well known in the art (von Heijne, J. Mol. Biol. J 84:99-105, 
1985). Prokaryotic and eukaryotic secretion signals that are functional in E. coli (or 
5 other host) may be employed. The presently preferred secretion signals include, but are 
not limited to pelB, mata, extensin and glycine-rich protein. 

One skilled in the art appreciates that there are a wide variety of suitable 
vectors for expression in bacterial cells and which are readily obtainable. Vectors such 
as the pET series (Novagen, Madison, WI) and the tac and trc series (Pharmacia, 
10 Uppsala, Sweden) are suitable for expression of a P-glucuronidase. A suitable plasmid 

is ampicillin resistant, has a colEI origin of replication, lacl^ gene, a lac/trp hybrid 

promoter in front of the lac Shine-Dalgamo sequence, a hexa-his coding sequence that 
joins to the 3' end of the inserted gene, and an rmB terminator sequence. 

The choice of a bacterial host for the expression of a p-glucuronidase is 
15 dictated in part by the vector. Commercially available vectors are paired with suitable 
hosts. The vector is introduced in bacterial cells by standard methodology. Typically, 
bacterial ceils are treated to allow uptake of DNA (for protocols, see generally, Ausubel 
et aL, supra; Sambrook et ai, supra). Alternatively, the vector may be introduced by 
electroporation, phage infection, or another suitable method. 

20 

Expression in plant cells 

As noted above, the present invention provides vectors capable of 
expressing microbial secreted p-glucuronidase and secreted microbial p-glucuronidases. 
For agricultural applications, the vectors should be functional in plant cells. Suitable 
25 plants include, but are not limited to, wheat, rice, com, soybeans, lupins, vegetables, 
potatoes, canola, nut trees, coffee, cassava, yam, alfalfa and other forage plants, cereals, 
legumes and the like. In one embodiment, rice is a host for GUS gene expression. 

Vectors that are functional in plants are preferably binary plasmids 
derived from Agrobacterium plasmids. Such vectors are capable of transforming plant 
30 cells. These vectors contain left and right border sequences that are required for 
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integration into the host (plant) chromosome. At minimum, between these border 
sequences is the gene to be expressed under control of a promoter. In preferred 
embodiments, a selectable gene is also included. The vector also preferably contains a 
bacterial origin of replication for propagation in bacteria. 



with a promoter that is functional in a plant cell. Typically, the promoter is derived 
from a host plant gene, but promoters from other plant species and other organisms, 
such as insects, fungi, viruses, mammals, and the like, may also be suitable, and at times 
preferred. The promoter may be constitutive or inducible, or may be active in a certain 

10 tissue or tissues (tissue type-specific promoter), in a certain cell or cells (cell-type 
specific promoter), of at a particular stage or stages of development (development-type 
specific promoter). The choice of a promoter depends at least in part upon the 
application. Many promoters have been identified and isolated (e.g., CAMV35S 
promoter, maize Ubiquitin promoter) (see, generally, GenBank and EMBL databases). 

15 Other promoters may be isolated by well-known methods. For example, a genomic 
clone for a particular gene can be isolated by probe hybridization. The coding region is 
mapped by restriction mapping, DNA sequence analysis, RNase probe protection, or 
other suitable method. The genomic region immediately upstream of the coding region 
comprises a promoter region and is isolated. Generally, the promoter region is located 

20 in the first 200 bases upstream, but may extend to 500 or more bzises. The candidate 
region is inserted in a suitable vector in operative linkage with a reporter gene, such as 
in pBI121 in place of the CaMV 35S promoter, and the promoter is tested by assaying 
for the reporter gene after transformation into a plant cell, (see, generally, Ausubel et 
al, supra\ Sambrook et al.^ supra; Methods in Plant Molecular Biology and 

25 Biotechnology, Ed. Click and Thompson, CRC Press, 1 993 .) 



transformants. The selectable marker preferably confers a growth advantage under 
appropriate conditions. Generally, selectable markers are drug resistance genes, such as 
neomycin phosphotransferase. Other drug resistance genes are known to those in the art 
30 and may be readily substituted. Selectable markers include, ampicillin resistance. 



5 



A gene for microbial p-glucuronidase should be in operative linkage 



Preferably, the vector contains a selectable marker for identifying 
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tetracycline resistance, kanamycin resistance, chloramphenicol resistance, and the like. 
The selectable marker also preferably has a linked constitutive or inducible promoter 
and a termination sequence, including a polyadenylation signal sequence. Other 
selection systems, such as positive selection can alternatively be used (U.S. Patent 

5 Nos. ). 

The sequence of nucleotides encoding p-glucuronidase may also include 
a classical secretion signal, whereby the resulting peptide is a precursor protein 
processed and secreted. Suitable signal sequences of plant genes include, but are not 
limited to the signal sequences from glycine-rich protein and extensin. In addition, a 
10 glucuronide permease gene to faciUtate uptake of glucuronides may be co-transfected 
either from the same vector containing microbial GUS or from a separate expression 
vector. 

A general vector suitable for use in the present invention is based on 
pBI121 (U.S. Patent No. 5,432,081) a derivative of pBIN19. Other vectors have been 

15 described (U.S. Patent Nos. 4,536,475; 5,733,744; 4,940,838; 5,464,763; 5,501,967; 
5,731,179) or may be constructed based on the guidelines presented herein. The 
plasmid pBI121 contains a left and right border sequence for integration into a plant 
host chromosome and also contains a bacterial origin of replication and selectable 
marker. These border sequences flank two genes. One is a kanamycin resistance gene 

20 (neomycin phosphotransferase) driven by a nopaline synthase promoter and using a 
nopaline synthase polyadenylation site. The second is the coli GUS gene (reporter 
gene) under control of the CaMV 35S promoter and polyadenlyated using a nopaline 
synthase polyadenylation site. The E. coli GUS gene is replaced with a gene encoding a 
secreted form of p-glucuronidase. If appropriate, the CaMV 35S promoter is replaced 

25 by a different promoter. Either one of the expression units described above is 
additionally inserted or is inserted in place of the CaMV promoter and GUS gene. 

Plants may be transformed by any of several methods. For example, 
plasmid DNA may be introduced by Agrobacterium co-cultivation {e.g., U.S. Patent 
No. 5,591,616; 4,940,838) or bombardment {e.g., U.S. Patent No. 4,945,050; 5,036,006; 

30 5,100,792; 5,371,015). Other transformation methods include electroporation (U.S. 
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Patent No. 5,629,183), CaP04-mediated transfection, gene transfer to protoplasts 
(AUB 600221), microinjection, and the like {see. Gene Transfer to Plants, Ed. 
Potrykus and Spangenberg, Springer, 1995, for procedures). Preferably, vector DNA is 
first transfected into Agrobacterium and subsequently introduced into plant cells. Most 

5 preferably, the infection is achieved by Agrobacterium co-cultivation. In part, the 
choice of transformation methods depends upon the plant to be transformed. Tissues 
can alternatively be efficiently infected by Agrobacterium utilizing a projectile or 
bombardment method. Projectile methods are generally used for transforming 
sunflowers and soybean. Bombardment is often used when naked DNA, typically 

10 Agrobacterium binary plasmids or pUC-based plasmids, is used for transformation or 
transient expression. 



by fi-eeze-thaw method (Holsters et al, Mol Gen. Genet, 163: 181-187, 1978) or by 
other suitable methods (see, Ausubel, et al. supra; Sambrook et ai, supra). Briefly, a 

15 culture of Agrobacterium containing the plasmid is incubated with leaf disks, 
protoplasts, meristematic tissue, or calli to generate transformed plants (Bevan, Nucl 
Acids. Res. 72:871 1, 1984) (U.S. Patent No. 5,591,616). After co-cultivation for about 
2 days, bacteria are removed by washing and plant cells are transferred to plates 
containing antibiotic {e.g., cefotzixime) and selecting medium. Plant cells are further 

20 incubated for several days. The presence of the transgene may be tested for at this time. 
After further incubation for several weeks in selecting medium, calli or plant cells are 
transferred to regeneration mediimi and placed in the light. Shoots are transferred to 
rooting medium and then into glass house. 



25 produce a clean fracture at the plane of the embryonic axis, which are placed cut surface 
up on medium with growth regulating hormones, minerals and vitamin additives. 
Explants from other tissues or methods of preparation may alternatively be used. 
Explants are bombarded with gold or tungsten microprojectiles by a particle 
acceleration device and cultured for several days in a suspension of transformed 

30 Agrobacterium. Explants are transferred to medium lacking growth regulators but 



Briefly, co-cultivation is performed by first transforming Agrobacterium 



Briefly, for microprojectile bombardment, cotyledons are broken off to 
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containing drug for selection and grown for 2-5 weeks. After 1 -2 weeks more without 
drug selection, leaf samples fix>m green, drug-resistant shoots are grafted to in vitro 
grown rootstock and transferred to soil. 

A positive selection system, such as using cellobiuronic acid and culture 
5 medium lacking a carbon source, is preferably used (see, co-pending application no. 
09/130,695). 

Activity of secreted GUS is conveniently assayed in whole plants or in 
selected tissues using a glucuronide substrate that is readily detected upon cleavage. 
Glucuronide substrates that are colorimetric are preferred. Field testing of plants may 
10 be performed by spraying a plant with the glucuronide substrate and observing color 
formation of the cleaved product. 

Classical tests for a transgene such as Southern blotting and 
hybridization or genetic segregation can also be performed. 

IS Expression in other organisms 

A variety of other organisms are suitable for use in the present invention. 
For example, various fimgi, including yeasts, molds, and mushrooms, insects, especially 
vectors for diseases and pathogens, and other animals, such as cows, mice, goats, birds, 
aquatic animals (e.g., shrimp, turtles, fish, lobster and other crustaceans), amphibians 

20 and reptiles and the like, may be transformed with a GUS transgene. 

The principles that guide vector construction for bacteria and plants, as 
discussed above, are applicable to vectors for these organisms. In general, vectors are 
well known and readily available. Briefly, the vector should have at least a promoter 
functional in the host in operative linkage with GUS. Usually, the vector will also have 

25 one or more selectable markers, an origin of replication, a polyadenylation signal and 
transcription terminator. 

The sequence of nucleotides encoding P-glucuronidase may also include 
a classical secretion signal, whereby the resulting peptide is a precursor protein 
processed and secreted. Suitable secretion signals may be obtained from a variety of 

30 genes, such as mat-alpha or invertase genes. In addition, a permease gene may be co- 
transfected. 
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One of ordinary skill in the art will appreciate that a variety of 
techniques for producing transgenic animals exist. In this regard, the following U.S. 
patents teach such methodologies and are thus incorporated herein by reference: U.S. 
Patent Nos. 5,162,215; 5,545,808; 5,741,957; 4,873,191; 5,780,009; 4,736,866; 
5 5,567,607; and 5,633,076. 

Uses of microbial p-glucuronidase 

As noted above, microbial p-glucuronidase may be used in a variety of 
applications. In certain aspects, microbial p-glucuronidase can be used as a 

10 reporter/effector molecule and as a diagnostic tool. As taught herein, microbial p- 
glucuronidase that is secretable is preferred as an in vivo reporter/effector molecule, 
whereas, in in vitro diagnostic applications, the biochemical characteristics of the p- 
glucuronidase disclosed herein (e.g., thermal stability, high turnover number) may 
provide preferred advantages. 

15 Microbial GUS, either secreted or non-secreted, can be used as a 

marker/effector for transgenic constructions. In a certain embodiments, the transgenic 
host is a plant, such as rice, com, wheat, or an aquatic animal. The transgenic GUS may 
be iised in at least three ways: one in a method of positive selection, obviating the need 
for drug resistance selection, a second as a system to target molecules to specific cells, 

20 and a third as a means of detecting and tracking linked genes. 

For positive selection, a host cell, {e.g., plant cells) is transformed with a 
GUS (preferably secretable GUS) transgene. Selection is achieved by providing the 
cells with a glucuroni dated form of a required nutrient (U.S. Patent Nos 5,994,629; 
5,767,378; PCX US99/17804). For example, all cells require a carbon source, such as 

25 glucose. In one embodiment, glucose is provided as glucuronyl glucose (cellobiuronic 
acid), which is cleaved by GUS into glucose plus glucuronic acid. The glucose would 
then bind to receptors and be taken up by cells. The glucuronide can be any required 
compound, including without limitation, a cytokinin, auxin, vitamin, carbohydrate, 
nitrogen-containing compound, and the like. It will be appreciated that this positive 

30 selection method can be used for cells and tissues derived from diverse organisms, such 
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as animal cells, insect cells, fiingi, and the like. The choice of glucuronide will depend 
in part upon the requirements of the host cell. 

As a marker/effector molecule, secreted GUS (s-GUS) is preferred 
because it is non-destructive, that is, the host does not need to be destroyed in order to 
5 assay enzyme activity. A non-destructive marker has special utility as a tool in plant 
breeding. The GUS enzyme can be used to detect and track linked endogenous or 
exogenous! y introduced genes. GUS may also be used to generate sentinel plants that 
serve as bioindicators of environmental status. Plant pathogen invasion can be 
monitored if GUS is under control of a pathogen promoter. In addition, such transgenic 

iO plants may serve as a model system for screening inhibitors of pathogen invasion. In 
this system, GUS is expressed if a pathogen invades. In the presence of an effective 
inhibitor, GUS activity will not be detectable. In certain embodiments, GUS is co- 
transfected with a gene encoding a glucuronide permease. 

Preferred transgenes for introduction into plants encode proteins that 

15 aifect fertility, including male sterility, female fecundity, and apomixis; plant protection 
genes, including proteins that confer resistance to diseases, bacteria, fungus, nematodes, 
viruses and insects; genes and proteins that affect developmental processes or confer 
new phenotypes, such as genes that control meristem development, timing of flowering, 
cell division or senescence (e.g., telomerase) toxicity (e.g., diphtheria toxin, saporin) 

20 affect membrane permeability (e.g., glucuronide permease (U.S. Patent No. 5,432,08 1)), 
transcriptional activators or repressors, and the like. 

Insect and disease resistance genes are well known. Some of these genes 
are present in the genome of plants and have been genetically identified. Others of 
these genes have been found in bacteria and are used to confer resistance. 

25 Particularly well known insect resistance genes are the crystal genes of 

Staphylococcus thuringiensis. The crystal genes are active against various insects, such 
as lepidopterans, Diptera^ Hemiptera and Coleoptera. Many of these genes have been 
cloned. For examples, see, GenBank; U.S. Patent Nos. 5,317,096; 5,254,799; 
5,460,963; 5,308,760, 5,466,597, 5,2187,091, 5,382,429, 5,164,180, 5,206,166, 

30 5,407,825, 4,918,066. Gene sequences for these and related proteins may be obtained 
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by standard and routine technologies, such as probe hybridization of a B. thtiringiensts 
library or amplification {see generally, Sambrook et al., supra, Ausubel et al supra). 
The probes and primers may be synthesized based on publicly available sequence 
information. 

5 Other resistance genes to Sclerotinia, cyst nematodes, tobacco mosaic 

virus, flax and crown rust, rice blast, powdery mildew, verticillum wilt, potato beetle, 
aphids, as well as other infections, are useful within the context of this invention. 
Examples of such disease resistance genes may be isolated from teachings in the 
following references: isolation of rust disease resist£ince gene from flax plants (WO 

10 95/29238); isolation of the gene encoding Rps2 protein from Arabidopsis thaliana that 
confers disease resistance to pathogens carrying the avrRpt2 avirulence gene (WO 
95/28478); isolation of a gene encoding a lectin-Hke protein of kidney bean confers 
insect resistance (JP 71-32092); isolation of the Hml disease resistance gene to C 
carbonum from maize (WO 95/07989); for examples of other resistance genes, see WO 

15 95/05743; U.S. Patent No. 5,496,732; U.S. Patent No, 5,349,126, EP 616035; EP 
392225; WO 94/18335; JP 43-20631; EP 502719; WO 90/11770; U.S. Patent 
5,270,200; U.S. Patent Nos. 5,218,104 and 5,306.863). In addition, general methods for 
identification and isolation of plant disease resistance genes are disclosed (WO 
95/28423). Any of these gene sequences suitable for insertion in a vector according to 

20 the present invention may be obtained by standard recombinant technology techniques, 
such as probe hybridization or amplification. When amplification is performed, 
restriction sites suitable for cloning are preferably inserted. Nucleotide sequences for 
other transgenes, such as controlling male fertility, are found in U.S. Patent No. 
5,478,369, references therein, and Mariani et al. Nature 347\11>1, 1990. 

25 In similar fashion, microbial GUS, preferably secreted, can be used to 

generate transgenic insects for tracking insect populations or facilitate the development 
of a bioassay for compounds that affect molecules critical for insect development (e.g., 
juvenile hormone). Secreted GUS may also serve as a marker for beneficial fungi 
destined for release into the environment. The non-destructive marker is useful for 

30 detecting persistence and competitive advantage of the released organisms. 
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In animal systems, secreted GUS may be used to achieve extracellular 
detoxification of glucuronides (e.g, toxin giucuronide) and examine conjugation 
patterns of glucuronides. Furthermore, as discussed above, secreted GUS may be used 
as a transgenic marker to track cells or as a positive selection system, or to assist in 
development of new bioactive GUS substrates that do not need to be transported across 
membrane- Aquatic animals are suitable hosts for GUS transgene. GUS may be used 
in these animals as a marker or effector molecule. 

Within the context of this invention, GUS may also be used in a system 
to target molecules to cells. This system is particularly useful when the molecules are 
hydrophobic and thus, not readily delivered. These molecules can be useful as effectors 
(e.g., inducers) of responsive promoters. For example, molecules such as ecdysone are 
hydrophobic and not readily transported through phloem in plants. When ecdysone is 
glucuroni dated it becomes amphipathic and can be delivered to cells by way of phloem. 
Targeting of compounds such as ecdysone-glucuronic acid to cells is accomplished by 
causing cells to express receptor for ecdysone. As ecdysone receptor is naturally only 
expressed in insect cells, however a host cell that is transgenic for ecdysone receptor 
will express it. The giucuronide containing ecdysone then binds only to cells 
expressing the receptor. If these cells also express GUS, ecdysone will be released from 
the giucuronide and able to induce expression from an ecdysone-responsive promoter. 
Plasmids containing ecdysone receptor genes and ecdysone responsive promoter can be 
obtained from Invitrogen (Carlsbad, CA), Other ligand-receptors suitable for use in this 
system include glucocorticoids/glucocorticoid receptor, estrogen/estrogen receptor, 
antibody and antigen, and the like {see also U.S. Patent Nos. 5,693,769 and 5,612,317). 

In another aspect, purified microbial p-glucuronidase is used in medical 
applications. For these applications, secretion is not a necessary characteristic although 
it may be a desirable characteristic for production and purification. The biochemical 
attributes, such as the increased stability and enzymatic activity disclosed herein are 
preferred characteristics. The microbial glucuronidase preferably has one or more of 
the disclosed characteristics. 
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For the majority of drug or pharmaceutical analysis, the compounds in 
urine, blood, sziliva, or other bodily fluids are de-glucuronidated prior to analysis. Such 
a procedure is undertaken because compounds are often, if not nearly always, detoxified 
by glucuronidation in vertebrates. Thus, drugs that are in circulation and have passed 
5 through a site of glucuronidation (e.g.. liver) are found conjugated to glucuronic acid. 
Such glucuronides yield a complex pattern upon analysis by, for example, HPLC. 
However, after the aglycone (drug) is cleaved from the glucuronic acid, a spectrum can 
be compared to a reference spectrum. Currently, E. coli GUS is utilized in medical 
diagnostics, but as shown herein, microbial GUS, e.g. Staphylococcus GUS has superior 
10 qualities. 

The microbial GUS enzymes disclosed herein may be used in traditional 
medical diagnostic assays, such as described above for drug testing, pharmacokinetic 
studies, bioavailability studies, diagnosis of diseases and syndromes, following 
progression of disease or its response to therapy and the like {see U.S. Patent Nos. 

15 5,854,009, 4,450,239, 4,274,832, 4,473,640, 5,726,031, 4,939,264, 4,115,064, 
4,892,833). These p-glucuronidase enzymes may be used in place of other traditional 
en2:ymes (e.g., alkaline phosphatase, horseradish peroxidase, beta-galactosidase, and the 
like) and compounds {e.g., green fluorescent protein, radionuclides) that serve as 
visualizing agents. Microbial GUS has qualities advantageous for use as a visualizing 

20 agent: it is highly specific for the substrate, water soluble and the substrates are stable- 
Thus, microbial GUS is suitable for use in Southern analysis of DNA, Northern 
analysis, ELISA, and the like. 

In preferred embodiments, microbial GUS binds a hapten, either as a 
fusion protein with a partner protein that binds the hapten {e.g., avidin that binds biotin, 

25 antibody) or alone. If used alone, microbial GUS can be mutagenized and selected for 
hapten-binding abilities. Mutagenesis and binding assays are well known in the art. In 
addition, microbial GUS can be conjugated to avidin, streptavidin, antibody or other 
hapten binding protein and used as a ref>orter in the myriad assays that currently employ 
enzyme-linked binding proteins. Such assays include immimoassays. Western blots, in 

30 situ hybridizations, HPLC, high-throughput binding assays, and the like (see, for 
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examples, U.S. Patent Nos. 5,328,985 and 4,839,293, which teach avidin and 
streptavidin fusion proteins and U.S. Patent No. 4,298,685, Diamandis and 
Christopoulos, Clin, Chem. 37:625, 1991; Richards, Methods EnzymoL 184:3, 1990; 
Wilchek and Bayer, Methods En2ymol 184:467, 1990; Wilchek and Bayer, Methods 
5 EnzymoL J 84:5, 1990; Wilchek and Bayer, Methods EnzymoL }84:\4, 1990; Dunn, 
Methods MoL BioL 32:221, 1994; B loch, J. Hitochem. Cytochem, 41:1751, 1993; Bayer 
and Wilchek y. Chromatogr. 510:3, 1990, which teach various applications of enzyme- 
linked technologies and methods). 



10 glucuroni dating compoimds such as drugs, the compound is inactivated. When a 
glucuronidase is expressed or targeted to the site for delivery, the glucuronide is cleaved 
and the compound delivered. For these purposes, GUS may be expressed as a transgene 
or delivered, for example, coupled to an antibody specific for the target cell {see e.g.. 



U.S. Patent Nos. 5,075,340, 4,584,368, 4,481,195, 4,478,936, 5,760,008, 5,639,737, 
15 4,588,686). 



protein or expression vectors containing microbiaLGUS gene. One exemplary type of 
kit is a dipstick test. Such tests are widely utilized for establishing pregnancy, as well 
as other conditions. Generally, these dipstick tests assay the glucuronide form, but it 

20 would be advantageous to use reagents that detect the aglycone form. Thus, GUS may 
be immobilized on the dipstick adjacent to or mixed in with the detector molecule {e.g., 
antibody). The dipstick is then dipped in the test fluid {e.g., urine) and as the 
compovinds flow past GUS, they are cleaved into aglycone and glucuronic acid. The 
aglycone is then detected. Such a setup may be extremely useful for testing compounds 

25 that are not readily detectable as glucuronides. 



bind a glucuronide, but lack enzymatic activity. The enzyme will then bind the 
glucuronide and the enzyme is detected by standard methodology. Alternatively, GUS 
is fused to a second protein, either as a fusion protein or as a chemical conjugate, that 
30 binds an aglycone. The fusion is incubated with the test substance and an indicator 



Microbial GUSes can also be used in therapeutic methods. By 



The present invention also provides kits comprising microbial GUS 



In a variation of this method, the microbial GUS enzyme is engineered to 
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substrate is added. This procedure may be used for ELISA, Northern, Southern analysis 
and the like. 

The following examples are offered by way of illustration, and not by 
5 way of limitation. 
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EXAMPLES 



EXAMPLE 1 



Identification of Microbes that Express P-Glucuronidase 



5 



Skin microbes are obtained using cotton swabs immersed in 0.1% 



Triton® X-100 and rubbing individual arm pits or by dripping the solution directly into 
arm pits and recovering it with a pipette. Seven individuals are sampled. Dilutions 
(1:100, 1:1000) of arm pit swabs are plated on O.IX and 0.5X TSB (Tiyptone Soy 
10 Broth, Difco) agar containing 50 ^g/mL X-GlcA (5-bromo-4-chloro-3-indolyI p-D- 
glucuronide), an indicator substrate for p-glucuronidase. This substrate gives a blue 
precipitate at the site of enzyme activity (see U.S. Patent No. 5,268,463). TSB is a rich 
medium which promotes growth of a wide range of microorganisms. Plates are 
incubated at ST'^C. 

15 Soil samples (ca. 1 g) are obtained from an area in Canberra, ACT, 

Australia (10 samples) and from Queanbeyan, NSW, Australia (12 samples). Although 
only one of the ten samples from Canberra is intentionally taken from an area of pigeon 
excrement, most isolates displaying P-glucuronidase activity are in the genera 
Enterobacter or Salmonella. Soil samples are shaken in 1-2 mL of water; dilutions of 

20 the supernatant are treated as for skin samples, except that incubation is at 30°C and 
l.OX TSB plates are used rather than diluted TSB. Some bacteria lose vitality if 
maintained on diluted medium, although the use of full-strength TSB usually delays, 
but does not prevent, the onset of indigo build up from X-GIcA hydrolysis. 



25 pattern (halo) surrounding the colony. The appearance of blue colonies varies in time, 
from one to several days. Under these conditions (aerobic atmosphere and rich 
medium) many microorganisms grow. Of these, approximately 0.1-1% display P- 
glucuronidase phenotype, with the secretory phenotj^e being less common than the 
non-secretory phenotype. 

30 Colonies that exhibit a strong, diffuse staining pattern are selected for 

further purification, which consists of two or more streaking of those colonies. 



Microbes that secrete P-glucuronidase have a strong, diffuse staining 
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Occasionally segregation of color production can be observed after the purification 
procedure. In Table 1 below, a sununary of the findings is presented- Some strains are 
listed as GUS secretion-negative because a later repetition of the halo test was negative, 
showing that the phenotype can vary, possibly because of growth conditions. 

5 Phylogenetic analysis 

For phylogenetic identification of the microbes, a variable region of 1 6S 
rDNA is amplified using primers, P3-16SrDNA and 1100r-16SrDNA {see Table 2), 
derived from two conserved regions within stem-loop structures of the rRNA. The 
amplified region corresponds to nucleotides 361 to 705 of E. coli rRNA, including the 

10 primers. Amplification conditions for 16S rDNA are 94°C for 2 min; followed by 35 
cycles of 94°C for 20 sec, 48°C for 40 sec, 72°C for 1.5 min; followed by incubation at 
72°C for 5 min. 

Amplified fragments are separated by electrophoresis on TAE agarose 
gels (approximately 1.2%), excised and extracted by freeze-fracture and phenol 

15 treatment. Fragments are further purified using Qiagen (Clifton Hill, Vic, Australia) 
silica-based membranes in microcentrifuge tubes. Purified DNA Augments are 
sequenced using the amplification primers in combination with BigDye™ Primer Cycle 
Sequencing Kit from Perkin-Elmer ABI (fluorescent dye termal cycling sequencing) 
(Foster City, CA). Cycling conditions for DNA sequence reactions are: 2 min at 94**C, 

20 followed by 30 cycles of 94°C for 30 sec, 50X for 15 sec, and 60°C for 2 min. A \0\xL 
reaction uses 4 of BigDye"'"^ Terminator mix, 1 \xL of 10 primer, and 200- 
500 ng of DNA. The reaction products are precipitated with ethanol or iso-propanol, 
resuspended and subjected to gel separation and nucleotide analysis. 

The ribosomal sequences are aligned and assigned to phylogenetic 

25 placement using the facilities of the Ribosomal Database Project of Michigan State 
University (rdpwww.life.uiuc.edu which now contains more than 10,000 16S rlWA 
sequences (Maidak et al, Nucl. Acids Res. 27:171-173; 1999). Phylogenetic placement 
is used to select strains for further study. 
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Table 1 

STRAIN GUS GUS Genus and Phylogenetic position 

Secretion Amplif tentative species 



SKIN 

Firmicutes / Bacillus-Lactobadllus- 
EH2 yes Staphylococcus wameri Streptococcus SutxJivision 

Firmicutes / Bacillus-La ctobactUus- 



EH4 + 


ves 


Staphylococcus wameri 


Streptococcus Subdivision 








Firmicutes / Bacillus-Lactobacillus- 


EH4-110A 


ves 


Staphylococcus wameri 


Streptococcus Sutxlivtsion 






Sta D h vl ococcu s 


Firmicutes / Bactllus-Laciobacillijs- 


LS-B + 


ves 


haemophilus/hominl 


Streptococcus Sutxliviston 










PG3A + 


no 


Staphylococcus honniniAwameri 


Streotococcus Sutxlivision 










on 1 ~ 


no 


Staphylococcus wameri/aureus 


StreDtococcus Suhdivifiion 










SH1C + 




Staphylococcus wameri/aureus 


Streotococcus Sutxlivtsion 








Firmicutes / Bacillus-Lactobadllus- 


CRA1 + 


no 


Staphylococcus wameri 


Stfeptococcus Subdivision 








Firmicutes / Badllus-Lactotjadllus- 


CRA2 + 


no 


Stephylococcus wameri 


Streptococcus Subdivision 


CANBERRA SOIL 














Proteobacteria - Gamma Subarvlsion - 


CSW1 a 


ves 


Satmonella/Enterobacte r 


Enterics and Relatives 








Proteobacteria - Gamma Sutxjivision - 


CSW1b 


yes 


Salmonella/Enterobacter 


Enterics and Relatives 








Proteobacteria - Gamma Sutxliviston - 


CDS1 + 


no 


Salmonella/Enterobacter 


Enterics and Relatives 








Proteobacteria - Gamma Subdivision - 


CBP1 


yes 


Salmonella/Enterobacter 


Enterics and Relatives 








Proteobacteria • Gamma Subdivision - 


CS2.1 


no 


Salmonella/Enterobacter 


Enterics and Relatives 








Proteobacteria - Gamma Subdivision - 


CS2.3 


no 


Salmonella/Enterobacter 


Enterics and Relatives 


QUEANBEYAN SOIL 














Proteobacteria - Gamma Subdivision - 


Ql,2 


yes 


PseudomonasyAzospirillum 


Pseudomonas and Relatives 








Firmicutes - Actinobacteria - 


Q1.3 + 


no 


Arthrobacter 


Micrococcineae 








Proteobacteria - Gamma Subdivision - 


Q2VD3 


yes 


Pseudomonas/Azospirillum 


Pseudomonas and Relatives 








Firmicutes - Actinobacteria - 


Q2VD6 


yes 


Arthrobacter 


Micrococcineae 








Firmicutes - Actinobacteria - 


Q2VD7 


yes 


Ctavibacterium 


Micrococcineae 








Firmicutes / Bacillus-Lactobacilius- 


Q3WR2 + 


no 


Planococcus 


Streptococcus Sut>division 








Firmicutes - Actinobacteria - 


Q3WR6 + 


yes 


Micrococcus 


Micrococcineae 








Firmicutes - Actinobacteria - 


Q4DS1 


no 


Curtobacterium 


Micrococcineae 








Firmicutes - Actinobacteria - 


QRM1 


no 


Arthrobacter 


Micrococcineae 








Firmicutes - Actinobacteria - 


QRM2 


no 


Arthrobacter 


Micrococcineae 
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Proteobacteria - Gamma Subdivision - 
QRM6 - no Pseud omonas Pseudomonas and Relatives 

Firmicutes - Actinobacteria - 
QTCR3 + no Arthrobacter Micrococci neae 

^ where two genera or species are listed, the rRNA analysis is inconclusive 

As can be observed from the table above, all GUS expressing skin 
isolates belong to the genus Staphylococcus and to a limited number of species. 
Staphylococcus warrteri and Staphylococcus homini or haemophilus. The Canberra soil 
samples all belonged to the genera Salmonella/Enterobacter (bacteria are herein 
referred to in shorthand as Salmonella). These two genera are very similar in the 16S 
rRNA, thus a conclusive identification of the genus requires additional analyses. In 
contrast, a higher degree of microbial diversity was found in the Queanbeyan strains. 
Several bacteria are chosen for further studies. 

The presence of GUS genes is established by amplification using 
degenerate oligonucleotides derived from a conserved region of the GUS gene. A pair 
of oligonucleotides is designed using an alignment of E. coli gusA and human GUS 
sequences. The primer T3-GUS-2F covers E. coli GUS amino acids 163-168 
(DFFNYA), while T7-GUS-5B covers the complementary sequence to amino acids 
' 549-553 (WNFAD). The full length of E. coli GUS is 603 amino acids. As shown in 
Table 1, amplification is not always successful, likely due to mismatching of the 
primers with template. Thus, a negative amplification does not necessarily signify that 
the microorganism lacks a GUS gene. 



EXAMPLE 2 

Cloning of GUS Genes by Genetic Complementation 

Genomic DNA of several candidate strains is isolated and digested with 
one of the following enz>'Tnes, EcoR I, BamH I, ///wd III, Pst\. Digested DNA 
fi-agments are ligated into the corresponding site of plasmid vector pBluescript II SK 
(+), and the ligation mix is electroporated into E. coli KWl, which is a strain deleted 
for the complete GUS operon. Colonies are plated on LB-X-GlcA plates and assayed 
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for blue color. Halo formation is not used as a criterium, because behavior of the GUS 
gene in a different genetic background may alter the phenotype or. detectability. In 
general though, halo formation is obtained in KWl. 

Isolated plasmids from GUS+ transformants are retransformed into KWl 
5 and also into DH5a to demonstrate that the GUS gene is contained Nvithin the construct. 
In all cases, retransformant colonies stained blue with X-GlcA. 

EXAMPLE 3 

10 DNA Sequence analysis of GUS Genes Isolated by Complementation 

DNA sequence is detennined for the isolates that amplified from the 
primers T3 and T7, which flank the pBS polylinker. Cyclic thermal sequencing was 
done as above, except that elongation time is increased to 4 min to allow for longer 
15 sequence determinations. Alternatively, transposon mutagenesis was used to introduce 
sequencing primer sites randomly into the GUS gene (GPS kit: New England Biolabs, 
MA, USA). 

The sequence information is used to design new oligonucleotides to 
obtain the full-length sequence of the clones. 

20 

Table 2 



PRIMER 


BASES 


Tm 


SEQUENCE 


SEQ ID 
No 












GUS-2T 


16 


30.3 


AYT TYT TYA AYT AYG C 


1 

1 1 


GUS-5B 


18 


49 - 5 


GAA RTC IGC RAA RTT CCA 




CSW-RTSHY(F) 


17 


47 . 9 


ATC GCA CGT CCC ACT AC 




CSW-RTSHy (R) 


18 


47 . 9 


CGT GCG ATA GGA GTT AGC 




EH-FRTSHY(F) 


22 


46 .1 


ATT TAG AAC ATC TCA TTA TCC C 




EH-PRTSHY(R) 


23 


47 . 6 


TGA GAT GTT CTA AAT GAA TTA GC 




I»SB-KRPVT<R) 


17 


53 .2 


ATC GTG ACC GGA CGC TT 




CBP-QAYDE 


17 


51.1 


GCG CGT AAT CTT CCT GG 




KG-RPIL 


18 


59 . 7 


TAG C(GA)C CTT CGC TTT CGG 




NG-RPIR 


20 


40. 7 


ATC ATG TTT ACA GAG TAT GG 




Tm-MVRPQRN 


17 


48.4 


ATG GTA AGA CCG CAA CG 




Tm-Nco- 
MVRPQRN 


25 


61-8 


TAA AAA CCA TGG TAA GAC CGC AAC G 
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BASES 


Tm 


SEQUENCE 


SEQ ID 

MO 


Tm-RRLWSE (R) 


20 


47 . 9 


CCT CAC TCC ACA GTC TTC TC 




: Tm-RRIAfSB (R> - 
■Nhe 


30 


67 .4 


AGA CCG CTA GCC TCA CTC CAC AGT CTT 




Ps-FDFFNYAtP) 


22 


47.1 


TTT GAG TTT TTC AAC TAT GCA G 




PS'DFFHYA(R) 


23 


47 .2 


AAT TCT GCA TAG TTG AAA AAG TC 




Salm-TBAQKS (R) 


17 


54.2 


CGC TCT TTT GCG CCT CC 




StS-GOMG(R) 


17 


57 


CCG CCG ATT GCC TGA CC 




P3-16S 


21 


60.8 


GGA ATA TTG CAC AAT GGG CGC 




1100R-16S 


15 


46 


GGG TTG CGC TCG TTG 















DNA sequences are obtained for GUS genes from six different genera: 
Enterobacter/Salmonella, Pseudomonas, Salmonella, Staphylococcus, and Thermotoga 
5 (see, TIGR database at www.tigr.org) (Figures 4A-J and 16). Predicted amino acids 
translations are presented in Figures 3A-B and 17. In addition to the biochemical 
analysis and amplification using GUS primers, confirmation that the isolates contain a 
GUS gene is obtained firom DNA and amino acid sequences. Amino acid aligiunent of 
Bacillus GUS (BGUS) with human (HGUS) and E. call (ECUS) reveal extensive 

JO sequence identity and similarity. Likewise, alignment using ClustalW program of 
Staphylococcus y Staphylococcus homini. Staphylococcus warneri, Thermotoga 
maritima, Enterobacter/Salmonella and E. coll show considerable amino acid identity 
and conservation (Figure 5B). The darker the shading, the higher the conservation 
among all GUSes. As seen in Figures 5B and 1 8, the region containing the critical 

15 catalytic residue (E344 using 5/ap/rK/ococcwj:_numbering) is highly conserved. This 
region extends over amino acids ca. 250 - ca. 360 and ca. 400 - ca. 535. Within these 
regions there are pockets of nearly complete identity. When construcdng variants, in 
general, the regions of highest identity are not altered. 

Two additional sequences from Salmonella and Pseudomonas are 

20 presented in nucleotide aligmnent with Staphylococcus. Significant sequence identity 
among the three sequences indicates that the Salmonella and Pseudomonas sequences 
are p-glucuronidase coding sequences. A full length Salmonella (CBPl) is also aligned 
with £. coU and Staphylococcus GUS. Overall identity is 71% and 51% nucleotide 
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identity to E, coli and Staphylococcus, respectively, and 85% and 46% amino acid 
identity to £. coli and Staphylococcus, respectively. 

5 EXAMPLE 4 

Isolation of a Gene from Staphylococcus and Salmonella Encoding a Secreted 

P-Glucuronidase 

Soil samples and skin samples are placed in broth and plated for growth 
10 of bacterial colonies on agar plates containing 50 fig/mL X-GlcA. Bacteria that secrete 
p-glucuronidase have a strong, diffuse staining pattern surrounding the colony. 

One bacterial colony that exhibited this type of staining pattern is 
chosen. The bacterium is identified as a Staphylococcus based on amplification of 16S 
rRNA, and is most likely in the Staphylococcus pseudomegaterium group. 
15 Oligonucleotide sequences derived from areas exhibiting a high degree of similarity 
between E. coli and himian p-glucuronidases are used in amplification reactions on 
Staphylococcus and E. coli DNA. A fragment is observed using Staphylococcus DNA, 
which is the same size as the E. coli fragment. 

Staphylococcus DNA is digested with Hind III and ligated to Hind III- 
20 digested pBSII-KS plasmid vector. The recombinant plasmid is transfected into KWl, 
an E. coli strain that is deleted for the GUS operon. Cells are plated on X-GIcA plates, 
and one colony exhibited strong, diffuse staining pattern, suggesting that this clone 
encoded a secreted P-glucuronidase enzyme. The plasmid, pRAJal7.L is isolated and 
subjected to analysis. 

25 The DNA sequence of part of the insert of pRAJal 7. 1 is shown in Figure 

1. A schematic of the 6029 bp fragment is shown in Figure 2. The fragment contains 
four large open reading frames. The open reading frame proposed as Staphylococcus 
GUS (GUS^'P) begins at nucleotide 162 and extends to 1907 (Figure 1). The predicted 
translate is shown in Figure 3 A and its alignment with E. coli and human p- 

30 glucuronidase is presented in Figiu-e 5 A. GUS^*** is 47.2% identical to E. coli GUS, 
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which is about the same identity as human GUS and E, coli GUS (49.1%). Thus, GUS 
from Staphylococcus is about as related to another bacterium as to human. One striking 
difference in sequence among the proteins is the number of cysteine residues. Whereas, 
both human and E, coli GUS have 4 and 9 cysteines, respectively, GUS^^ has only one 
cysteine. 

The secreted GUS protein is 602 amino acids long and does not appear 
to have a canonical leader peptide. A prototypic leader sequence has an amino-terminal 
positively charged region, a central hydrophobic region, and a more polar carboxy- 
terminal region {see, von Heijne, J. Membrane Biol, 775:195-201, 1990) and is 
generally about 20 amino acids long. However, in both mammalian and bacterial cells, 
proteins without canonical or identifiable secretory sequences have been found in 
extracellular or peripiasmic spaces. 

A bacterium identified by 165rRNA as Salmonella is isolated on the 
basis of halo formation. The predicted protein is 602 amino acids. There are 7 cysteine 
residues and 1 glycosylation site (Asn-Leu-Ser) at residue 358 (referenced to E. coli 
GUS). The Salmonella and E. coli sequences are very similar (71% nucleotide and 85% 
amino acid identity) reflecting the very close phylogeny of these genera. Salmonella 
GUS is less closely related to Staphylococcus GUS (51% nucleotide and 46% amino 
acid identity). 

To simplify nomenclature, the following is proposed: the p- 
glucuronidase gene is called gusA: To distinguish origins of genes, a superscript is 
used to identify the genus, and species (if known). Thus E. coli GUS gene is gusA*^, 
Staphylococcus GUS gene is gusA^'^, Salmonella GUS gene is gusA^^' and so on. 
Proteins are abbreviated as gus^°, GUS^'^ and so on. 



EXAMPLE 5 
Properties of Secreted P-Glucuronidase 
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Although the screen described above suggests that the Stcq}hylococcus 
GUS is secreted, the cellular localization of GUS^^^ is further examined. Cellular 
fractions {e.g.^ periplasm, spheroplast, supernatant, etc.) are prepared from KWl cells 
transformed with pRAJal 7. 1 or a subfragment that contains the GUS gene and from E. 
5 coll cells that express p-glucuronidase. GUS activity and p-galactosidase (p-gal) 
activity is determined for each fraction. The percent of total activity in the periplasm 
fraction for GUS and p-gal (a non-secreted protein) are calculated; the amount of p-gaJ 
activity is considered backgroimd and thus is subtracted from the amount of p- 
glucuronidase activity. In Figure 6, the relative activities of GUS^*** and E. coli GUS in 

10 the periplasm fraction are plotted. As shown, approximately 50% of GUS^*** activity is 
found in the periplasm, whereas less than 1 0% of E. coli GUS activity is present. 

The thermal stability of GUS^^ and E. coli GUS enzymes are determined 
at 65 °C, using a substrate that can be measured by spectrophotometry, for example- 
One such substrate is p-nitrophenyl p-D-glucuronide (pNPG), which when cleaved by 

15 GUS releases the chromophore p-nitrophenol. At a pH greater than its pKa 
(approximately 7.15), the ionized chromophore absorbs light at 400-420 nm, therefore 
appears in the yellow range of visible light. Briefly, reactions are performed in 50 mM 
Na3P04 pH 7.0, 10 mM 1 mM EDTA, 1 mM pNPG, and 0.1% Triton® X-100 at 

37°C- The reactions are terminated by the addition of 0.4 ml of 2-amino-2- 

20 methylpropanediol, and absorbance measured at 415 nm against a substrate blank. 
Under these conditions, the molar extinction coefficient of p-nitrophenol is assumed to 
be 14,000. One unit is defined as the amount of enzyme that produces 1 nmole of 
product/min at 37°C. 

As shown in Figure 7, GUS^^ has a half-life of approximately 1 6 min, 

25 while E. coli GUS has a half-life of less than 2 min. Thus, GUS^'p is at least 8 times 
more stable than the E, coli GUS. In addition, the catalytic properties of GUS^'^ are 
substantially better than the E. coli enzyme: The Km is approximately one-fourth to 
one-third and the Vmax is about the same at 37°C. 

Table 2 



Staph GUS E, coli GUS 
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Km 


30-40 pNPG 


120 |iM pNPG 


Vmax 


80 nmoles/min/^g 


80 nmoles/min/^g 



The turnover number of GUS^*** is approximately the same as E. coli 
GUS at 37°C and 2.5 to 5 times higher than E. coli GUS at room temperature (Figures 8 
and 9). Turnover number is calculated as imioles of pNPG converted to p>-nitrophenol 
5 per min per ^g of purified protein. 

GUS^ enzyme activity is also resistant to inhibition by detergents. 
Enzyme activity assays are measured in the presence of varying amounts of SDS, 
Triton® X-iOO, or sarcosyl. As presented in Figure 10, GUS^*^ was not inhibited or 
only slightly inhibited ( < 20% inhibition) in Triton® X-100 and Sarcosyl. In SDS, the 

10 enzyme still had substantial activity (60-75% activity). In addition, GUS^ is not 
inhibited by the end product of the reaction. Activity is determined normally or in the 
presence of 1 or 10 mM glucuronic acid. No inhibition is seen at either 1 or 10 mM 
glucuronic acid (Figure 11). The enzyme is also assayed in the presence of organic 
solvents, dimethylformamide (DMF) and dimethylsulfoxide (DMSO), and high 

15 concentrations of NaCl (Figure 12). Only at the highest concentrations of DMF and 
DMSO (20%) does GUS^^ demonstrate inhibition, approximately 40% inhibited. In 
lesser concentrations of organic solvent and in the presence of 1 M NaCl, GUS^*^ retains 
essentially complete activity. 

The Staphylococcus P-glucuronidase is secreted in E. coli when 

20 introduced in an expression plasmid as evidenced by approximately half of the enzyme 
activity being detected in the periplasm. In contrast, less than 10% of E. coli p- 
glucuronidase is found in periplasm. Secreted microbial GUS is also more stable than 
E. coli GUS (Figure 7), has a higher turnover number at both 37°C and room 
temperature (Figures 8 and 9), and unlike E. coli GUS, it is not substantially inhibited 

25 by detergents (Figure 10) or by glucuronic acid (Figure 11) and retains activity in high 
salt conditions and organic solvents (Figure 12). 

As shown herein, multiple mutations at residues Val 128, Leu 141, 
Tyr 204 and Thr 560 (Figures 3A-B) result in a non-functional enzyme. Thus, at least 
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one of these amino acids is critical to maintaining enzyme activity. A mutein 
Staphylococcus GUS containing the amino acid alterations of Val 128 -^AJa, Leu 141 
->His, Tyr 204->Asp and Thr 560— > Ala is constructed and exhibits little enzymatic 
activity. As shown herein, the residue alteration that most directly affected activity is 
5 Leu 141. In addition, three residues have been identified as likely contact residues 
important for catalysis in human GUS (residues Glu 451, GIu 540, and Tyr 504) (Jain et 
ah. Nature Struct. Biol. 3: 375, 1996). Based on alignment v^th Staphylococcus GUS, 
the corresponding residues are Glu 415, Glu 508, and Tyr 471 . By analogy with human 
GUS, Asp 165 may also be close to the reaction center and likely forms a salt bridge 
10 with Arg 566. Thus, in embodiments where it is desirable to retain enzymatic activity 
of micorbial GUS, the residues corresponding to Leu 141, Glu 415, Glu 508, Tyr 471, 
Asp 165, and Arg 566 in Staphylococcus GUS are preferably unaltered. 



15 EXAMPLE 6 

Construction of a Codon Optimized Secreted P-Glucuronidase 

The Staphylococcus GUS gene is codon-optimized for expression in E. 
coll and in rice. Codon frequencies for each codon are determined by back translation 

20 using ecohigh codons for highly expressed genes of enteric bacteria. These ecohigh 
codon usages are available from GCG. The most frequently used codon for each amino 
acid is then chosen for synthesis. In addition, the polyadenyiation signal, AATAAA, 
splice consensus sequences, ATTTA AGGT, and restriction sites that are found in 
polylinkers are eliminated. Other changes may be made to reduce potential secondary 

25 structure. To facilitate cloning in various vectors, four different 5' ends are synthesized: 
the first, called AO (GT CGA C CC ATG G T A GAT CT G ACT AGT CTG TAC CCG) 
uses a sequence comprising an Nco I (underlined), Bgl II (double underlined), and Spe I 
(italicized) sites. The Leu (CTG) codon is at amino acid 2 in Figures 3A-B. The 
second variant, called AI {GTC GAC AGG AGT GCT ATC ATG CTG TAC CCG), 

30 adds the native Shine/Dalgamo sequence 5' of the initiator Met (ATG) codon; the third. 
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called All, {GTC GAC AGG ACT GCT ACCATGGTG TAG CCG) adds a modified 
Shine/Dalgamo sequence 5' of the initiator Met codon such that a Nco I site is added; 
the fourth one, called AIII (GTC GAC AGG AGT GCT A CC ATG G TA GAT CTG 
TAG CCG) adds a modified Shine/Dalgamo sequence 5* of the Leu (CTG) codon 
5 (residue 2) and Nco I and Bgl II sites.. All of these new 5' sequences contain a Sal I site 
at the extreme 5' end to facilitate construction and cloning. In certain embodiments, to 
facilitate protein purification, a sequence comprising a Nhe I, Pml I, and BstE II sites 
(underlined) and encoding hexa-His amino acids joined at the 3' (COOH-terminus) of 
the gene. 

10 GCTAGC CATCACCATCACCAT CACGTG TGAATT GGTGACC G 
SerSerHisHisHisHisHisHisVal * 

Nucleotide and amino acid sequences of one engineered secretable 
microbial GUS are shown in Figures 13A-C, and a schematic is shown in Figure 14. 

15 The coding sequence for this protein is assembled in pieces. The sequence is dissected 
into four fragments, A (bases 1-457); B (bases 458-1012); C (bases 1013-1501); and D 
(bases 1502-1875). Oligonucleotides (Table 4) that are roughly 80 bases (range 36-100 
bases) are synthesized to overlap and create each fragment. The fragments are each 
cloned separately and the DNA sequence verified. Then, the four fragments are excised 

20 and assembled in pLITMUS 39 (New England Biolabs, Beverley, MA), which is a 
small, high copy number cloning plasmid. 

Table 3 



Oligonucleotide 


Size 


Sequence 


SEQ ID 
NO 


gusA^'P A-1-80T 


80 


TCGACCCATGGTAGATCTGACTAGTCTGTACCCGA 
TCAACACCGAGACCCGTGGCGTCTTCGACCTCAAT 
GGCGTCTGGA 




gusA^'^ A- 121 -2008 


80 


GGATTTCCTTGGTCACGCCAATGTCATTGTAACTG 
CTTGGGACGGCCATACTAATAGTGTCGGTCAGCTT 
GCTTTCGTAC 




gusA^'P A-I61-240T 


80 


CCAAGCAGTTACAATGACATTGGCGTGACCAAGGA 
AATCCGCAACCATATCGGATATGTCTGGTACGAAC 
GTGAGTTCAC 




gusA^* A-201-280B 


80 


GCGGAGCACGATACGCTGATCCTTCAGATAGGCCG 
GCACCGTGAACTCACGTTCGTACCAGACATATCCG 
ATATGGTTGC 
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f^l 1 cr A n 1 1 o1 #1 tfl 


Size 




SEQID 
NO 


gusA^^A-241-320T 


80 


GGTGCCGGCCTATCTGAAGGATCAGCGTATCGTGC 
TCCGCTTCGGCTCTGCAACTCACAAAGCAATTGTC 
TATGTCAATG 




gusA^^P A-281-360B 


80 


AATGGCAGGAATCCGCCCTTGTGCTCCACGACCAG 
CTCACCATTGACATAGACAATTGCTTTGTGAGTTG 
CAGAGCCGAA 




gusA^'P A-321-400T 


80 


GTGAGCTGGTCGTGGAGCACAAGGGCGGATTCCTG 
CCATTCGAAGCGGAAATC7VACAACTCGCTGCGTGA 
TGGCATGAAT 




gusA^^P A-361-460B 


100 


GTACAGCCCCACCGGTAGGGTGCTATCGTCGAGGA 
TGTTGTCCACGGCGACGGTGACGCGATTCATGCCA 
TCACGCAGCGAGTTGTTGATTTCCGCTTCG 




gusA^^ A-401-456T 


56 


CGCGTCACCGTCGCCGTGGACAACATCCTCGACGA 
TAGCACCCTACCGGTGGGGCT 




gusA^'P A-41-120B 


80 


CACTTCTCTTCCAGTCCTTTCCCGTAGTCCAGCTT 
GAAGTTCCAGACGCCATTGAGGTCGAAGACGCCAC 
GGGTCTCGGT 




gusA^'P A-6-40B 


35 


TTGATCGGGTACAGACTAGTCAGATCTACCATGGG 




gusA^'P A-81-I60T 


80 


ACTTCAAGCTGGACTACGGGAAAGGACTGGAAGAG 
AAGTGGTACGAAAGCAAGCTGACCGACACTATTAG 
TATGGCCGTC 




gusA^^ B-I-80T 


80 


GTACAGCGAGCGCCACGAAGAGGGCCTCGGAAAAG 
TCATTCGTAACAAGCCGAACTTCGACTTCTTCAAC 
TATGCAGGCC 




gusA^^ B-121-200B 


80 


CTTTGGCTTGAAAGTCCACCGTATAGGTCACAGTC 
CCGGTTGGGCCATTGAAGTCGGTCACAACCGAGAT 
GTCCTCGACG 




gusA^'P B-161-240T 


80 


ACCGGGACTGTGACCTATACGGTGGACTTTCAAGG 
CT^GCCGAGACCGTGAAAGTGTCGGTCGTGGATG 
AGGAAGGCAA 




gusA^'P B-201-280B 


80 


CTCCACGTTACCGCTCAGGCCCTCGGTGCTTGCGA 
CCACTTTGCCTTCCTCATCCACGACCGACACTTTC 
ACGGTCTCGG 




gusA^'P B-241-320T 


80 


AGTGGTCGCAAGCACCGAGGGCCTGAGCGGTAACG 
TGGAGATTCCGAATGTCATCCTCTGGGAACCACTG 
AACACGTATC 




gusA^'P B-281-360B 


80 


GTCAGTCCGTCGTTCACCAGTTCCACTTTGATCTG 
GTAGAGATACGTGTTCAGTGGTTCCCAGAGGATGA 
CATTCGGAAT 




gusA^'P B-321-400T 


80 


TCTACCAGATCAAAGTGGAACTGGTGAACGACGGA 
CTGACCATCGATGTCTATGAAGAGCCGTTCGGCGT 
GCGGACCGTG 




gusA^'P B-361-440B 


80 


ACGGTTTGTTGTTGATGAGGAACTTGCCGTCGTTG 
ACTTCCACGGTCCGCACGCCGAACGGCTCTTCATA 
GACATCGATG 
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SEQID 
NO 


gusA*** B-401-480T 


80 


GAAGTCAACGACGGCAAGTTCCTCATCAACAACAA 
ACCGTTCTACTTCAAGGGCTTTGGCAAACATGAGG 
ACACTCCTAT 




gusA^^ B-41-120B 


80 


TACGTAAACGGGGTCGTGTAGATTTTCACCGGACG 
GTGCAGGCCTGCATAGTTGAAGAAGTCGAAGTTCG 
GCTTGTTACG 




gusA^'P B-441-520B 


80 


ATCCATCACATTGCTCGCTTCGTTAAAGCCACGGC 
CGTTGATAGGAGTGTCCTCATGTTTGCCAAAGCCC 
TTGAAGTAGA 




gusA^' B-481-555T 


75 


CAACGGCCGTGGCTTTAACGAAGCGAGCAATGTGA 
TGGATTTCAATATCCTCAAATGGATCGGCGCCAAC 
AGCTT 




gusA=** B-5-40B 


36 


AATGACTTTTCCGAGGCCCTCTTCGTGGCGCTCGC 
T 




gusA^* B-521.559B 


39 


CCGGAAGCTGTTGGCGCCGATCCATTTGAGGATAT 
TGAA 




gusA^* B-81-160T 


80 


TGCACCGTCCGGTGAAAATCTACACGACCCCGTTT 
ACGTACGTCGAGGACATCTCGGTTGTGACCGACTT 
CAATGGCCCA 




gusA^^^P C-1-80T 


80 


CCGGACCGCACACTATCCGTACTCTGAAGAGTTGA 
TGCGTCTTGCGGATCGCGAGGGTCTGGTCGTGATC 
GACGAGACTC 




gusA^'P C-121-200B 


80 


GTTCACGGAGAACGTCTTGATGGTGCTCAAACGTC 
CGAATCTTCTCCCAGGTACTGACGCGCTCGCTGCC 
TTCGCCGAGT 




gusA^'P C-161-240T 


80 


ATTCGGACGTTTGAGCACCATCAAGACGTTCTCCG 
TGAACTGGTGTCTCGTGACAAGAACCATCCAAGCG 
TCGTGATGTG 




gusA^^ C.201.280B 


80 


CGCGCCCTCTTCCTCAGTCGCCGCCTCGTTGGCGA 
TGCTCCACATCACGACGCTTGGATGGTTCTTGTCA 
CGAGACACCA 




gusA^^'P C-241-320T 


80 


GAGCATCGCCAACGAGGCGGCGACTGAGGAAGAGG 
GCGCGTACGAGTACTTCAAGCCGTTGGTGGAGCTG 
ACCAAGGAAC 




gusA^'P C-281-360B 


80 


ACAAACAGCACGATCGTGACCGGACGCTTCTGTGG 
GTCGAGTTCCTTGGTCAGCTCCACCAACGGCTTGA 
AGTACTCGTA 




gusA^'P C-321-400T 


80 


TCGACCCACAGAAGCGTCCGGTCACGATCGTGCTG 
TTTGTGATGGCTACCCCGGAGACGGACAAAGTCGC 
CGAACTGATT 




gusA^* C-36i-440B 


80 


CGAAGTACCATCCGTTATAGCGATTGAGCGCGATG 
ACGTCAATCAGTTCGGCGACTTTGTCCGTCTCCGG 
GGTAGCCATC 




gusA^'P C-401-489T 


89 


GACGTCATCGCGCTCAATCGCTATAACGGATGGTA 
CTTCGATGGCGGTGATCTCGAAGCGGCCAAAGTCC 
ATCTCCGCCAGGAATTTCA 
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Oligonucleotide 


Size 


Sequence 


SEQID 
NO 


gusA^ C-41-120B 


80 


CCCGTGGTGGCCATGAAGTTGAGGTGCACGCCAAC 
TGCCGGAGTCTCGTCGATCACGACCAGACCCTCGC 
GATCCGCAAG 




gusA'^ C-441-493B 


53 


CGCGTGAAATTCCTGGCGGAGATGGACTTTGGCCG 
CTTCGAGATCACCGCCAT 




gusA^'P C-5-40B 


36 


ACGCATCAACTCTTCAGAGTACGGATAGTGTGCGG 
T 




gusA^^ C-8I-160T 


80 


CGGCAGTTGGCGTGCACCTCAACTTCATGGCCACC 
ACGGGACTCGGCGAAGGCAGCGAGCGCGTCAGTAC 
CTGGGAGAAG 




gusA^^ D-1-80T 


80 


CGCGTGGAACAAGCGTTGCCCAGGAAAGCCGATCA 
TGATCACTGAGTACGGCGCAGACACCGTTGCGGGC 
TTTCACGACA 




gusA^ D-121-200B 


80 


TCGCGAAGTCCGCGAAGTTCCACGCTTGCTCACCC 
ACGAAGTTCTCAAACTCATCGAACACGACGTGGTT 
CGCCTGGTAG 




gusA^'P D-161-240T 


80 


TTCGTGGGTGAGCAAGCGTGGAACTTCGCGGACTT 
CGCGACCTCTCAGGGCGTGATGCGCGTCCAAGGAA 
ACAAGAAGGG 




gusA^^P D-20I-280B 


80 


GTGCGCGGCGAGCTTCGGCTTGCGGTCACGAGTGA 
ACACGCCCTTCTTGTTTCCTTGGACGCGCATCACG 
CCCTGAGAGG 




gusA^'P D-241-320T 


80 


CGTGTTCACTCGTGACCGCAAGCCGAAGCTCGCCG 
CGCACGTCTTTCGCGAGCGCTGGACCAACATTCCA 
GATTTCGGCT 




gusA^^ D-281-369B 


89 


CGGTCACCAATTCACACGTGATGGTGATGGTGATG 
GCTAGCGTTCTTGTAGCCGAAATCTGGAATGTTGG 
TCCAGCGCTCGCGAAAGAC 




gusA^'^ D-321-373T 


53 


ACAAGAACGCTAGCCATCACCATCACCATCACGTG 
TGAATTGGTGACCGGGCC 




gusA^'P D-41-120B 


80 


TACTCGACTTGATATTCCTCGGTGAACATCACTGG 
ATCAATGTCGTGAAAGCCCGCAACGGTGTCTGCGC 
CGTACTCAGT 




gusA'^P D-5-40B 


36 


GATCATGATCGGCTTTCCTGGGCJ^CGCTTGTTCC 
A 




gusA'^ D-8I-I60T 


80 


TTGATCCAGTGATGTTCACCGAGGAATATCAAGTC 
GAGTACTACCAGGCGAACCACGTCGTGTTCGATGA 
GTTTGAGAAC 





The AI form of microbial GUS in pLITMUS 39 is transfected into KWl 
host E. coli cells. Bacterial cells are collected by centrifugation, washed with Mg salt 
solution and resuspended in IMAC buffer (50 mM Na3P04, pH 7.0, 300 mM KCl, 0.1% 
Triton® X-100, 1 mM PMSF). For hexa-His fusion proteins, the lysate is clarified by 
centrifugation at 20,000 rpm for 30 min and batch absorbed on a Ni-IDA-Sepharose 
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column. The matrix is poured into a column and washed with IMAC buffer containing 
75 mM imidazole. The p-glucuronidase protein bound to the matrix is eluted with 
IMAC buffer containing 10 mM EDTA. 

If GUS is cloned without the hexa-His tail, the lysate is centrifuged at 
5 50,000 rpm for 45 min, and diluted with 20 mM NaPO^, I mM EDTA, pH 7.0 (buffer 
A). The diluted supernatant is then loaded onto a SP-Sepharose or equivalent column, 
and a linear gradient of 0 to 30% SP Buffer B (1 M NaCl, 20 mM NaPO^, 1 mM EDTA, 
pH 7.0) in Buffer A with a total of 6 column volumes is applied. Fractions containing 
GUS are combined. Further purifications can be performed. 



EXAMPLE 7 
MUTEINS OF CODON OPTIMIZED P-GLUCURONIDASE 



15 Muteins of the codon-optimized GUS genes are constructed. Each of the 

four GUS genes described above, AO, AJ, All, and AIII, contain none, one, or four 
amino acid alterations. The muteins that contain one alteration have a Leu 141 to His 
codon change. The muteins that contain four alterations have the Leu 141 to His 
change as well as Val 138 to Ala, Tyr 204 to Asp, and Thr 560 to Ala changes. 

20 pLITMUS 39 containing these 12 muteins are transfected into KWl. Colonies are 
tested for secretion of the introduced GUS gene by staining with X-GlcA. A white 
colony indicates undetectable GUS activity, a light blue colony indicates some 
detectable activity, and a dark blue colony indicates a higher level of detectable activity. 
As shown in Table 5 below, when GUS has the four mutations, no GUS activity is 

25 detectable. When GUS has a single Leu 141 to His mutation, three of the four 
constructs exhibit no GUS activity, while the AI construct exhibits a low level of GUS 
activity. All constructs exhibit GUS activity when no mutations are present. Thus, the 
Leu 141 to His mutation dramatically affects the activity of GUS. 



30 



Table 4 
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Number of 
Mutations 


GUS construct 




AO 


AI 


All 


AIII 


4 


white 


white 


white 


white 


1 


white 


light blue 


white 


white 


0 


light blue 


dark blue 


light blue 


light blue 



EXAMPLE 8 
Expression of Microbial P-Glucuronidases 
5 IN Yeast, Plants and E. cou 

A series of expression vector constructs of three different GUS genes, E, 
coll GUS, Staphylococcus GUS, and the AO version of codon-optimized Staphylococctis 
GUS, are prepared and tested for enzymatic activity in E. coli^ yeast, and plants (rice, 

10 Millin variety). The GUS genes are cloned in vectors that either contain a signal 
peptide suitable for the host or do not contain a signal peptide. The E. coli vector 
contains a sequence encoding a pelB signal peptide, the yeast vectors contain a 
sequence encoding either an invertase or Mat alpha signal peptide, and the plant vectors 
contain a sequence encoding either a glycine-rich protein (GRP) or extensin signal 

15 peptide. 

Invertase signal sequence: 

ATGCTTTTGC AAGCCTTCCT TTTCCTTTTG GCTGGTTTTG CAGCCAAAAT ATCTGCAATG (SEQ ID 
NO. ) 

20 Mat alpha signal sequence: 

atgagatttc cttct^tttt tactgcagtt ttattcgcag catcctccgc attagctgct 
ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt 

TACTTAQATT TAGAAGGGGA TTTCGATGTT GCTGTTTTGC CATTTTCCAA CAGCACAAAT 
AACGGGTTAT TGTTTATAAA TACTACTATT GCCAGCATTG CTGCTAAAGA AGAAGGGGTA 
25 TCTTTGGATA AAAGAGAG (SEQ ID NO. ) 

Extensin signal sequence 

CATGGGAAAA ATGGCTTCTC TATTTGCCAC ATTTTTAGTG GTTTTAGTGT CACTTAGCTT 
AGCTTCTGAA AGCTCAGCAA ATTATCAA (SEQ ID NO. _) 



30 



GRP signal sequence 

CATGGCTACT ACTAAGCATT TGGCTCTTGC CATCCTTGTC CTCCTTAGCA TTGGTATGAC 
CACCAGTGCA AGAACCCTCC TA (SEQ ID NO. ) 
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The GUS genes are cloned into each of these vectors using standard 
recombinant techniques of isolation of a GUS-gene containing fragment and ligation 
into an appropriately restricted vector. The recombinant vectors are then transfected 
into the appropriate host and transfectants are tested for GUS activity. 
5 As shown in the Table below, all tested transfectants exhibit GUS 

activity (indicated by a +). Moreover, similar results are obtained regardless of the 
presence or absence of a signal peptide. 



Table 5 



GUS 


E, coli 


Yeast 


Plants 




No SP* 


pelB 


No SP 


inveitase 


Mat a 


No SP 


GRP 


Extensin 


£. CO// GUS 


+ 


NT 


+ 


+ 


+ 




+ 




Staphylococcus 
GUS 




NT 


+ 


+ 


+ 


+ 




-1- 



10 *; SP=signal peptide 

EXAMPLE 9 

Elimination of the Potential N-Glycosylation Site 
15 of Staphylococcus P-Glucuronidase 

The consensus N-glycosylation sequence Asn-X-Ser/Thr is present in 
Staphylococcus GUS at amino acids 118-120, Asn-Asn-Ser (Figures 3A-B). 
Glycosylation could interfere with secretion or activity of P-glucuronidase upon 

20 entering the ER. To remove potential N-glycosylation, the Asn at residue 118 is 
changed to another amino acid in the plasmid pTANE95m (AI) is altered. The GUS in 
this plasmid is a synthetic GUS gene with a completely native 5' end. 

The oligonucleotides Asn-T, 5 -A TTC CTG CCA TTC GAG GCG 
GAA ATC NNG AAC TCG CTG CGT GAT-3' (SEQ ID No. ) and Asn-B, 5'-ATC 

25 ACG CAG CGA GTT CNN GAT TTC CGC CTC GAA TGG CAG GAA T-3' (SEQ 
ID No. ), are used in the "quikchange" mutagenesis method by Stratagene (La 
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Jolla, CA) to randomize the first two nucleotides of the Asn 118 codon, AAC, The 
third base is changed to a G nucleotide, so that reversion to Asn is not possible. In 
theory a total of 13 different amino acids are created at position 118. 

Because expression of GUS from the plasmid pTANE95m (AI) exhibits 
a range of colony phenotypes from white to dark blue, a restriction enzyme digestion 
assay is used to confirm presence of mutants. Therefore, an elimination of a BstB I 
restriction site which does not change any amino acid, is also introduced into the 
mutagenizing oligonucleotides to facilitate restriction digestion screening of mutants. 

Sixty colonies were randomly picked and assayed by BstB I digestion. 
Twenty-one out of the 60 colonies have the BstB I site removed and are thus mutants. 
DNA sequence analysis of these candidate mutants show that a total of 8 different 
amino acids are obtained. Five of the Nl 18 mutants are chosen as suitable for fiirther 
experimentation. In these mutants, the N 1 1 8 residue is changed to a Ser, Arg, Leu, Pro, 
or Met. 

EXAMPLE 10 

Expression of P-Glucuronidase in Transgenic Rice Plants 

Microbial GUS can be used as a non-destructible marker. In this 
example, transgenic rice expressing a GUS gene encoding a secreted form are assayed 
for GUS expression in planta. 

Seeds of TO plants, which are the primary transformed plants, from 
pTANG86. 1/2/3/4/5/6 (see Table 7 below) transformed plants, seeds of pC AM 1301 (£. 
coli GUS with N358-Q change to remove N-glycosylation signal sequence) transformed 
plants, or untransformed Millin rice seeds are germinated in water containing 1 mM 
MUG or 50 (ig/mL X-GlcA with or without hygromycin (for nontransformed plants). 
Resulting plants are observed for any reduced growth due to the presence of MUG, X- 
GlcA. No toxic effects of X-GIcA are detected, but roots of the plants grown in MUG 
are somewhat stunted. 
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For assaying GUS activity in planta, seeds are germinated in water with 
or without hygromycin (for nontransformed plants). Roots of the seedlings are 
submerged in water containing 1 mM MUG, or 50 ^g/mL X-GlcA. Fluorescence (in 
the case of MUG staining) or indigo dye (in the case of X-GIcA staining) are assayed in 
the media and roots over time. 

Secondary roots from seedlings of pTANG86.3 and pTANG86.5 (GUS^'^ 
fused with signal peptides) plants show indigo color after 14 hour incubation in water 
containing X-GlcA. Evidence that GUS is a non-destructive marker is obtained by 
plant growth after transferring the stained plant to water. Furthermore, stained roots 
also grow further. 

EXAMPLE 1 1 
Expression of P-Glucuronidase in Yeast 

All the yeast plasmids are based on the Yep backbone, which contains a 
yeast centromere and is stable at low copy number. Yeast strain InvScl (mat a his'^-Al 
leu2 rrp 1-289 wra3-52) from Invitrogen (Carlsbad, CA) is transformed with the E, coli 
GUS and Staphylococcus GUS plasmids indicated in the table below. Transformants 
are plated on both selection media (minimal media supplemented with His, Leu, Trp. 
and 2% glucose as a carbon source to suppress the expression of the gene driven by the 
gall promoter) and expression media (media supplemented with His, Leu, Trp, 1% 
raffinose, 1% galactose as carbon source and with 50 >ig/ml X-GlcA). 



wo 00/55333 




PCT/USOO/07107 



Table 6 





Yeast 


Plants 




No SP 


Invertase 


Mat alpha 


No SP 


GRP 


Extensin 


E. coll 


pAKD80.3 


PAKD80.6 


pTAlMG87.4 


pTANG86.2 


pTANG86.4 


pTANG86.6 


Syn BGUS 


pTANG87.1 


pTANG87-2 


pTANG87.3 


pTANG86.1 


pTANG86,3 


pTANG86.5 


Nat BGUS 


pAKD 102.1 


pAKE2.I 


pAKEll.4 


pAKD40 


pAKC30.1 


PAKC30.3 



With the exception of pAKD80.6, all other transformed yeast colonies 
are white on X-GlcA plates. The transforraants do express GUS, however, which is 
5 evidenced by lysing the cells on the plates with hot agarose containing X-GIcA and 
observing the characteristic indigo color. The yeast transformants are white when GUS 
is not secreted, as X-GIcA cannot be taken by the yeast cell. All the yeast colonies 
transformed with pAKD80.6 are blue on X-GIcA plates and have a blue halo around 
each colony, clearly indicating that the enzyme is secreted into the medium. 

10 Staphylococcus GUS enzyme has a potential N-glycosylation site, which 

may interfere with the secretion process or cause inactivation of the enzyme upon 
secretion. To determine whether the N-glycosylation site has a deleterious effect, on 
secretion, yeast colonies are streaked on expression plates containing X-GIcA and from 
0. 1 to 20 |ig/ml of tunicamycin (to inhibit all N-giycosylation). At high concentrations 

15 of tunicamycin (5, 10, and 20 ^g/ml), yeast colonies do not grow, likely due to toxicity 
of the drug. However, in yeast transformed with pTANG87.3, the cells that do survive 
at these timicamycin concentrations are blue. This indicates that glycosylation may 
affect the secretion or activity of Staphylococcus GUS. Any effect should be overcome 
by mutating the glycosylation signal sequence as described. 

20 
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EXAMPLE 12 

Expression of Low-Cysteine E. coli P-Glucuronidase 

The E. coli GUS protein has nine cysteine residues, whereas, human 
GUS has four and Staphylococcus GUS has one. Low-cysteine muteins of E. coli GUS 
are constructed to provide a form of £cGUS that is secretable. 

Single and multiple Cys muteins are constructed by site-directed 
mutagenesis techniques. Eight of the nine cysteine residues in E. coli GUS are changed 
to the corresponding residue found in human GUS based on alignment of the two 
protein sequences. One of the E. coli GUS cysteine residues, amino acid 463, aligns 
with a cysteine residue in human GUS and was not altered. The corresponding amino 
acids between E. coli GUS and human GUS are shown below. 



Table 7 



Identifier 


EcGUS Cys residue no. 


Human GUS 
corresponding amino 
acid 


A 


28 


Asn 


B 


133 


Ala 


C 


197 


Ser 


D 


253 


GIu 


E 


262 


Ser 


F 


442 


Phe 


G 


448 


Tyr 


H 


463 


Cys 


I 


527 


Lys 



The mutein GUS genes are cloned into a pBS backbone. The mutations 
are confirmed by diagnostic restriction site changes and by DNA sequence analysis. 
Recombinant vectors are transfected into KWl and GUS activity assayed by staining 
with X-GlcA (5-bromo-4-chIoro-3-indolyl-p-D-gluciu-onide). 

As shown in the Table below, when the Cys residues at 442 (F), 448 (G), 
and 527 (1) are altered, GUS activity is greatly or completely diminished. In contrast. 
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when the N-terminal five Cys residues (A, B, C, D, and E) are altered, GUS activity 
remains detectable. 

Table 8 



Cys changes 


GUS activity 


A 


Yes 


B 


Yes 


C 


Yes 


I 


No 


D,E 


Yes 


F,G 


No 


C, D, E 


Yes 


B, C, D, E 


Yes 


A,B, C, D,E 


Yes 


A, B, C, D, E, I 


No 



From the foregoing, it will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating fi-om the spirit and scope of the 
invention. Accordingly, the invention is not limited except as by the appended claims. 
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CLAIMS 



We claim: 

1. An isolated nucleic acid molecule consisting essentially of a nucleotide 
sequence that encodes a microbial p-glucuronidase, provided that the microbial p- 
glucuronidase is not E. coli p-giucuronidase. 

2. The nucleic acid molecule of claim 1, wherein the microbial p- 
glucuronidase is encoded by a nucleic acid molecule comprising nucleotides 1-1689 of 
Figures 41- J or by a nucleic acid molecule that hybridizes imder stringent conditions to the 
complement of nucleotides 1-1689 of Figure 4I-J and which encodes a functional p- 
glucuronidase. 



3. The nucleic acid molecule of claim I, wherein the microbial p- 
glucuronidase comprises the amino acid sequences of Figure 5B, or a variants thereof, and 
which encodes a functional p-glucuronidase. 

4. The nucleic acid molecule of claim 1, wherein the microbe is a 

eubacteria. 



5. The nucleic acid molecule of claim 4, wherein the eubacteria is 
selected from the group consisting of purple bacteria, gram(+) bacteria, cyanobacteria, 
spirochaetes, green sulphur bacteria, bacteroides and flavobacteria, planctomyces, 
chlamydiae, radioresistant micrococci, and thermotogales. 



6. The nucleic acid molecule of claim 4, wherein the eubacteria is 
selected from the group consisting of Staphylococcus^ Bacillus, Salmonella, Enterobacter, 
Pseudomonas, Arthrobacter, Clavibacter and Thermotoga. 
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7. An isolated nucleic acid molecule encoding a thermostable p- 
glucuronidase, wherein the p-glucuronidase has a half-life of at least 10 min at 65**C. 

8. The nucleic acid molecule of claim 11, wherein the thermostable p- 
glucuronidase is firom Thermotoga or Staphylococcus groups. 

9. An isolated nucleic acid molecule encoding a microbial p- 
glucuronidase, wherein the p-glucuronidase converts at least 50 nmoles of p-nitrophenyl- 
glucuronide to p-nitrophenyl per minute per jig of protein at 37°C. 

10. An isolated nucleic acid molecule encoding a microbial p- 
glucuronidase, wherein the p-glucuronidase retains at least 80% of its activity in 10 mM 
glucuronic acid. 

11. An isolated nucleic acid molecule encoding a fusion protein of a 
microbial p-glucuronidase or an enzymatically active portion thereof and a second protein. 

12. The nucleic acid molecule of claim 1 1, wherein the second protein is 
an antibody or fragment thereof that binds antigen. 

13. An expression vector, comprising a nucleic acid sequence encoding a 
microbial P-glucuronidase in operative linkage with a heterologous promoter, provided that 
the microbial p-glucuronidase is not £, coli p-glucuronidase. 

14. The expression vector of claim 13, wherein the heterologous promoter 
is a promoter selected from the group consisting of a developmental type-specific promoter, a 
tissue type-specific promoter, a cell type-specific promoter and an inducible promoter. 
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15. The expression vector of claim 13, wherein the promoter is functional 
in a ceil selected from the group consisting of a plant cell, a bacterial cell, an animal cell and 
a fungal celL 

16. The expression vector of claim 13, wherein the vector is a binary 
Agrobacterium tumefaciens plasmid vector. 

17. The expression vector of claim 13, further comprising a nucleic acid 
sequence encoding a product of a gene of interest or portion thereof. 

18. The expression vector of claim 1 7, wherein the product is a protein. 

19. The expression vector of claim 13, further comprising a nucleic acid 
sequence encoding a protein that specifically binds a cell, wherein the protein is fused to the 
sequence encoding p-glucuronidase and wherein the vector encodes a fusion protein. 

20. The expression vector of claim 13, wherein the microbial P- 
glucuronidase is encoded by a nucleic acid molecule comprising nucleotides 1-1689 of 
Figures 41- J or by a nucleic acid molecule that hybridizes under stringent conditions to the 
complement of nucleotides 1-1689 of Figure 4I-J and which encodes a functional P- 
glucuronidase. 

21. The expression vector of claim 13, wherein the microbial p- 
glucuronidase comprises the amino acid sequences of Figure 5B, or a variants thereof, and 
which encodes a functional P-glucuronidase. 

22. The expression vector of claim 13, wherein the microbe is a eubacteria. 

23. The expression vector of claim 22, wherein the eubacteria is selected 
fipm the group consisting of purple bacteria, gram(+) bacteria, cyanobacteria, spirochaetes. 
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green sulphur bacteria, bacteroides and flavobacteria, pFanctomyces, chlamydiae, 
radioresistant micrococci, and thermotogales. 

24. The expression vector of claim 22, wherein the eubacteria is selected 
from the group consisting of Staphylococcus, Salmonella, Bacillus, Enterohacter, 
Pseudomonas, Arihrobacter, Clavibacter and Thermotoga. 

25- The expression vector of claim 13, wherein the microbial p- 
glucuronidase is a thermostable p-glucuronidase, wherein the p-glucuronidase has a half-life 
of at least 10 min at 65°C. 



26. The expression vector of claim 25, wherein the thermostable p- 
glucuronidase is from Thermotoga or Staphylococcus groups. 

27. The expression vector of claim 13, wherein the microbial P- 
glucuronidase converts at least 50 nmoles of p-nitrophenyl-glucuronide to p-nitrophenyl per 
minute per |j,g of protein at 37°C- 

28. The expression vector of claim 13, wherein the microbial p- 
glucuronidase retains at least 80% of its activity in 1 0 mM glucuronic acid. 

29. The expression vector of claim 13, wherein the microbial P- 
glucuronidase is an enzymatically active portion thereof. 

30- A host cell containing the vector according to claim 13. 



31. The host cell of claim 30, wherein the host cell is selected from the 
group consisting of a plant cell, an insect cell, a fungal cell, an animal cell and a bacterial cell. 
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32. An isolated form of recombinant microbial ^-glucuronidase, provided 
that the microbial P-glucuronidase is not E. coli P-glucuronidase. 

33. The p-glucuronidase of claim 32, wherein the microbe is a eubacteria. 

34. The P-glucuronidase of claim 33, wherein the eubacteria is selected 
from the group consisting of purple bacteria, gram(+) bacteria, cyanobacteria, spirochaetes, 
green sulphur bacteria, bacteroides and flavobacteria, planctomyces, chlamydiae, 
radioresistant micrococci, and thermotogales. 

35. The p-glucuronidase of claim 33, wherein the eubacteria is selected 
from the group consisting of Staphylococcus group. Salmonella group, Enterobacter group, 
Pseudomonas group, Arthrobacter group, Clavibacter group and Thermotoga group. 

36. The p-glucuronidase of claim 32, wherein the p-glucuronidase is 
encoded by a nucleic acid molecule comprising nucleotides 1-1689 of Figure 41 -J or by a 
nucleic acid molecule that hybridizes under stringent conditions to the complement of 
nucleotides 1-1689 of Figure 4I-J and which encodes a functional P-glucuronidase, 

37. The p-glucuronidase of claim 32. comprising the amino acid sequences 
of Figure 5B, or a variant thereof, £ind which encodes a functional P-glucuronidase. 

38. A method for monitoring expression of a gene of interest or a portion 
thereof in a host cell, comprising: 

(a) introducing into the host cell a vector construct, the vector construct 
comprising a nucleic acid molecule according to claim 1 and a nucleic acid molecule 
encoding a product of the gene of interest or a portion thereof; 

(b) detecting the presence of the microbial p-glucuronidase, thereby 
monitoring expression of the gene of interest. 
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39. A method for transforming a host cell with a gene of interest or portion 
thereof, comprising: 

(a) introducing into the host cell a vector construct, the vector construct 
comprising a nucleic acid sequence encoding a microbial p-glucuronidase, provided that the 
microbial p-glucuronidase is not E, coU p-glucuronidase, and a nucleic acid sequence 
encoding a product of the gene of interest or a portion thereof, such that the vector construct 
integrates into the genome of the host cell; 

(b) detecting the presence of the microbial p-glucuronidase, thereby 
establishing that the host cell is transformed. 

40. A method for positive selection for a transformed cell, comprising: 

(a) introducing into a host cell a vector construct, the vector construct 
comprising nucleic acid sequence encoding a microbial p-glucuronidase, provided that the 
microbial p-glucuronidase is not E. coli P-glucuronidase; 

(b) exposing the host cell to the sample comprising a glucuronide, wherein 
the glucuronide is cleaved by the p-glucuronidase, such that the compound is released, 

. wherein the compound is required for cell growth, 

41. The method of claim 40, further comprising introducing into the host 
cell a vector construct comprising a nucleic acid sequence encoding a microbial glucuronide 
pennease. 

42. The method of any one of claims 38-40, wherein the host cell is 
selected from the group consisting of a plant cell, an animal cell, an insect cell, a fungal cell 
and a bacterial cell. 

43 . A method of producing a transgenic plant that expresses a microbial P- 
glucuronidase, comprising: 

(a) introducing an expression vector comprising a nucleic acid sequence 
encoding a microbial p-glucuronidase in operative linkage with a heterologous promoter. 
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provided that the microbial p-glucuronidase is not E. coli P-glucuronidase, into an 
embryogenic plant cell; and 

(b) producing a plant from the embryogenic plant cell, wherein the plant 
expresses the p-glucuronidase. 

44. The method of claim 43, wherein the transgenic plant is rice. 

45. A method for positive selection for a transformed cell, comprising: 

(a) introducing into a host cell a vector construct, the vector construct 
comprising nucleic acid sequence encoding a microbial p-glucuronidase, provided that the 
microbial P-glucuronidase is not E. coli P-glucuronidase; 

(b) exposing the host cell to the sample comprising a glucuronide, wherein 
the glucuronide is cleaved by the p-glucuronidase, such that the compound is released, 
wherein the compound is required for cell growth 

46. A transgenic plant cell comprising an expression vector, comprising a 
nucleic acid sequence encoding a microbial p-glucuronidase in operative linkage with a 
heterologous promoter, provided that the microbial p-glucuronidase is not E. coli p- 
glucuronidase. 

47. A transgenic plant comprising an expression vector, comprising a 
nucleic acid sequence encoding a microbial p-glucuronidase in operative linkage with a 
heterologous promoter, provided that the microbial p-glucuronidase is not E, coli P- 
glucuronidase. 

48. A seed from the transgenic plant of claim 47. 

49. A transgenic aquatic animal cell comprising an expression vector, 
comprising a nucleic acid sequence encoding a microbial p-glucuronidase in operative 
linkage with a heterologous promoter. 
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50. A transgenic aquatic animal comprising an expression vector, 
comprising a nucleic acid sequence encoding a microbial p-glucuronidase in operative 
linkage with a heterologous promoter. 

51. A method for identifying a microorganism that secretes p- 
glucuronidase, comprising: 

(a) culturing the microorganism in a medium containing a substrate for p- 
glucuronidase, wherein the cleaved substrate is detectable, and wherein the microorganism is 
an isolate of a naturally occurring microorganism or a transgenic microorganism; and 

(b) detecting the cleaved substrate in the medium; 
therefrom identifying an organism that secretes p-glucuronidase. 

52. The method of claim 51, wherein the microorganism is isolated from 
soil, mud, skin, mucus or fecal matter. 

53. The method of claim 51, wherein the microorganism is cultured under 
conditions unfavorable to growth of Staphylococcus and favourable to other microorganisms. 

54. A method for providing an effector compound to a cell in a transgenic 
plant, comprising: 

(a) growing a transgenic plant that comprises an expression vector, 
comprising a nucleic acid sequence encoding a microbial p-glucuronidase in operative 
linkage with a heterologous promoter and a nucleic acid sequence comprising a gene 
encoding a cell surface receptor for an effector compoimd. 

(b) exposing the transgenic plant to a glucuronide, wherein the glucuronide 
is cleaved by the p-glucuronidase, such that the effector compound is released. 
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55. The method of claim 54, further comprising introducing into the 
transgenic plant a vector construct comprising a nucleic acid molecule encoding a 
glucuronide permease. 

56. The method of claim 55, further comprising introducing into the 
transgenic plant a vector construct comprising a nucleic acid sequence that binds the effector 
compound. 

57. The method of claim 56, further comprising a gene of interest in 
operative linkage with the nucleic acid sequence that binds the effector compound. 

58. The method of claim 54, wherein the effector compovmd is 

hydrophobic. 

59. The method of claim 56, wherein the effector compound is either 
ecdysone or a glucocorticoid. 
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FIGURE 1 



1 agcctttact tttctttcaa cttttcatcc 
61 aataatacaa gtcctgattt cgcaagaata 
121 taataacatg taaccactta catttaaaaa 
181 cagaaacccg aggagttttt gatttaaatg 
241 aaggactgga agaaaagtgg tatgaaccaa 
301 cttcctccta taatgatatc ggtgttacga 
361 ggtacgagcg Tigaatttacc gttcctgctt 
421 tcggttcagc aa-cicataag gctatcgcat 
481 aaggcggctt cttciccgttt gaggcagaaa 
541 gtgtaacagt agcggttgat aatattttag 
601 gtgaaagaca tgaagaaggt ttgggaaaag 
661 ttaactatgc aggcttacat cgtcctgtaa 
721 aggatatatc ggctgcaacc gattttaacg 
781 attttcaggg taaggcagaa accgcaaagg 
64 1 ctgcttcaac tgaaggcccc tctggcaatg 
901 ctttaaacac ctatctctat caaactaaag 
961 atgtatacga agagccatcc ggagntcgaa 
1021 Cuaataacaa accattttat tttaaagggt 
1081 gaagaggctt taatgaagca tcaaatgtaa 
1141 cgaattcctt ccggacggcg caccatcctt 
1201 gtgaagggtt agtcgtcata gatgaaaccc 
1261 caacgaccgg tttgggcgaa ggttcagaga 
1321 tcgaacacca tcaagargca ctgagagagc 
1381 ctgtcatgcg gccgaccgca aatgaagcgg 
1441 tuaagccact: agttgaatr.a acgaaagaat 
1501 tcctgtccgt aarggcgaca ccagaaacag 
1561 car.tgaaucg atacaacggc tggcacttitg 
1621 accrtcgtca geaattt^car gcgtggaata 
16 81 cagagtacgg ggctgatacc gtagctggtt 
1741 aagagcatca ggccgaata^ taccaagcaa 
1801 tngrrggcga gcaggcctgg aattttgcag 
1861 ttcaaggtaa caaaaaaggr: gtttccacac 
1921 tcttccgcga acgctggaca aacatcccgg 
1981 grrccccaat aggaggccag ctncctcaca 
2041 cttcattttt tacacaaaaa cgaagagcgt 



cgatactttt ttgtaatagt tttttccatt 
acccttttta gataaaaata tctatgctaa 
ggagtgctat catgttacat ccaatcaaca 
gggtctggaa ttttaaatta gactacggca 
aactgacaga taccatatca atggctgtac 
aggaaattcg aaaccatatc ggctacgtat 
atttaaaaga tcagcgcatc gtcctgcgtt 
acgtcaacgg agaactagta gttgaacaca 
taaacaacag cttaagagac ggaatgaatc 
atgattctac gctcccagtt gggctatata 
tgattcgtaa taaacctaat tttgacttct 
aaatttatac aacccctttt acctatgttg 
gtccaacggg aacagttacg tacacagttg 
ttagtgtagt tgatgaagaa gggaaagttg 
ttgaganccc taacgctanc cttcgggaac 
ttgagttagt aaatgatggt ctaactattg 
ccgttgaagt aaacgacggg aaattcctca 
tcggaaaaca cgaggacacc ccaataaatg 
tggattctaa tatttcgaaa tggatcggrg 
actctgaaga actgacgcgg ctcgcagatc 
cagcagccgg tgtccatttg aactttatgg 
gagtgagtac ttgggaaaaa atccggacct 
tggtctctcg tgacaaaaac cacccccctg 
ctacggaaga agaaggcgct tatgaacac-i 
tagatccaca aaaacgccca gttaccatiug 
ataaagtggc ggagntaatt gatgtgattg 
stgggggtga tcntgaagcc gcgaaagccc 
aacgctgtcc aggaaaacct ataacgataa 
trcacgacat tgatccggtt acgtccacag 
atcatgtagt atctgacgaa tttgagaacr. 
acttcgctac aagccagggt gccacgcgtg 
gcgaccgcaa accaaaatta gcagcacacg 
atttcggtita taaaaattaa taaaaagccg 
tggacacaa" ggctgtaaat caaaaaccct 
tttaatccct caaatigctat cacatrttct 
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FIGURE 2 



Transaldolase 




Staphylococcus GUS gene 
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FIGURE 3A 



A 



Staphylococcus P-glucuronidase 

1 MLYPINTETR GVFDIiNGVWN FKLDYGKGILE EKWYESKLTD TISMAVPSSY 

51 NDIGVTKEIR NHIGYVWYER EFTVPAYLKD QRXVUIFGSA THKAIVYVNG 

101 ELWEHKGGF LPFEAEINNS LRDGMNRVTV AVDNILDDST LPVGLYSERH 

151 EEGLGKVIRN KPNFDFFNYA GLHRPVKIYT TPFTYVEDIS WTDFNGPTG 

201 TVTYTVDFQG KAETVKVSW DEEGKWAST EGLSGNVEIP NVILWEPLNT 

251 YLYQIKVELV NDGLTIDVYE EPFGVRTVEV NDGKFLINNK PFYFKGFGKH 

301 EDTPINGRGF NEASNVMDFN ILKWIGANSF RTAHYPYSEE LMRLADREGL 

351 WIDETPAVG VHLNFMATTG LGEGSERVST WEKIRTFEHH QDVLRELVSR 

401 DKNHPSWMW SIANEAATEE EGAYEYFKPL VELTKELDPQ KRPVTIVLFV 

451 MATPETDKVA ELIDVIALTTR YNGWYFDGGD LEAAKVHLRQ EFHAWNKRCP 

501 GKPIMITEYG ADTVAGFHDI DPVMFTEEYQ VEYYQANHW FDEFENFVGE 

551 QAWNFADFAT SQGVMRVQGN KKGVFTRDRK PKLAAHVFRE RWTNIPDFGY 

601 KN 



Enterobacter/Salmonella fi-glucuronidase 

1 GKLSPTPTAY IQDVTVXTDV LENTEQATVL GNVGADGDIR VELRDGQQQI 

51 VAQGLGATGI FELDNPHLWE PGEGYLYELR VTCEANGECD EYPVRVGIRS 

101 ITXKGEQFLI NHKPFYLTGF GRHEDADFRG KGFDPVLMVH DHALMNWIGA 

151 NSYRTSHYPY AEKMLDWADE HVIWINETA AGGFNTLSLG ITFDAGERPK 

2 01 ELYSEEAING ETSQQAHLQA IKELIARDKN HPSWCWSIA NEPDTRPNGA 
251 REYFAPLAKA TREIDPTRPI TCVNVMFCDA ESDTITDLFD WCLNRYYGW 

3 01 YVQSGDLEKA EQMLEQELLA WQSKLHRPII ITEYGVDTLA GMPSVYPDMW 
3 51 SEKYQWKWLE MYHRVFDRGS VC 



Staphylococcus homini fi-D-glucuronidase 

1 GLSGNVEIPN VILWEPLNTY L.YQIKVELVN DGLTIDVYEE PFGVRTVEVN 

51 DGKFLINNKP FYFKGFGKHE DTPINGRGFN EASNVMDFNI LKWIGANSFR 

101 TAHYPYSEEL MRLADREGLV VIDETPAVGV HLNFiyiATTGL GEGSERVSTW 

151 EKIRTFEHHQ DVLRELVSRD KNHPSWMWS lANEAATEEE GAYEYFKPLG 

201 GAAKELDPXK RPVTIVLFVM ATPETDKVAE LIDVIALNRY NGWYFDGGDL 

251 EAAKVHLRQE FHAWNKRCPG KPIMITEYGA DTVAGFHDID PVMFTEEYQV 

3 01 EYYQANHWF DEFENFVGEQ AWNFADFATS QGVMRVQGNK KGVFTRDRKP 

351 XLAAHVFRER RTNIPDFGYK NASHHH 



B 



C 
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FIGURE 3B 



Staphylococcus wameri fi-D-glucuronidase 

1 LXLIiHPITTG TRGGFALYGX XNIiMIiDYGXG LTDTWTXSLL TELSRLWIiS 

51 WTTHXLTGEX PAISILWPNS ELTVSXLYXG SLXSSSXLCS SLTXHWICQ 

101 XVTLXVDHTG LIXXFEFMST TCCXXDELVT GTLAXILYHX ILPHGLYRKR 

151 HEXGIiGKXNF YXLHFAFFXY AXLXRTVXMY XNLVRXQDIX WTXXHXXXX 

201 TVEQCVXXNX KIXSVKITIL DENDHAIXES EGAKGNVTIQ NPILWQPLHA 

251 YLYNMKVEIiL NDNECVDVYT ERFGIRSVEV KDGQFLINDK PFYFKGFGKH 

301 EDTYXNGRGL NESANVMDIN LMKWIGANSF RTSHYPYSEE MMRIADEQGI 

351 WIDETTXVG IHLNFMXTLG GSXAHDTWXE FDTLEFHKEV IXDLIXRDKN 

401 KAWV^^M^FG NEXGXNKGGA KAXFEPFVNL AGEKDXXXXP VTrVTIIOCAX 

451 RNVCEVXDLV DWCLXXXXG WYXQSGDLEG AKXALDKEXX EWWKXQXNKP 

501 XMFTEYGVDX WGLXXXPDK MXPEEYKMXF YKGYXKIMDK 



Thermotoga maritima fi-glucuronidase 

1 MVRPQRNKKR FILILNGVWN LEVTSKDRPI AVPGSWNEQY QDLCYEEGPF 

51 TYKTTFYVPK XLSQKHIPiY FAAVNTDCEV FUSTGEKVGEN HIEYLPFEVD 

101 VTGKVKSGEN ELRW^yENRL KVGGFPSKVP DSGTHTVGFF GSFPPANFDF 

151 FPYGGIIRPV LIEFTDHARI LDIWVDTSES EPEKKLGIC/K VKIEVSEEAV 

201 GQEMTIKLGE EEKKIRTSNR FVEGEFILEN ARFWSLEDPY LYPLKVELEK 

-251 DEYTIiDIGIR TISWDEKRLY LNGKPVFLKG FGKHEEFPVL GQGTFYPLMI 

3 01 KDFNLLKWIN ANSFRTSHYP YSEEWLDLAD RLGILVIDEA PHVGITRYHY 

351 NPETQKIAED NIRRMIDRHK NHPSVIMWSV A^IEPESNHPD AEGFFKALYE 

401 TANEMDRTRP WMVSMMDAP DERTRDVALK YFDIVCVNRY YGWYIYQGRI 

451 EEGLQALEKD lEELYARHRK PIFVTEFGAD AIAGIHYDPP QMFSEEYQAE 

501 LVEKTIRLLL KKDYIIGTHV WAFADFKTPQ NVRRPILNHK GVFTRDRQPK 

551 LVAHVLRRLW SEV 
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FIGURE 4A 

Staphylococcus ^-glucuronidase 

MetLeuTyrProIleAsnThrGluThrArgGlyValPheAspLeuAsnGl 
1 ATGTTATATCCyU^TCAATACAGAAACCCGAGGAGTTTTTGATTTAAAT^ 

yValTrpAsnPheLysLeuAspTyrGlyLysGlyLeuGluGluLysTrpT 
5 1 GGTCTOGAATTTTAAATTAGATTACGGCAAAGGACTGGAAGAAAAGT^ 

yrGluSerLysLeuThrAspThrlleSerMetAlaValProSerSerTyr 
101 ATGAATCAAAACTGACAGATACCATATCAATGGCTGTACCTTCCTCCTAT 

AsnAspIleGlyValThrLysGluIleArgAsnHisIleGlyTyrValTr 
151 AATGATATCGGTGTTACGAAGGAAATTCGAAACCATATCGGCTATGTATG 

pTyrGluArgGluPheThrValProAlaTyrLeuLysAspGlnArglleV 
201 GTACGAGCGTGAATTTACCGTTCCTGCTTATTTAAAAGATCAGCGCATCG 

alLeuArgPheGlySerAlaThrHisLysAlalleValTyrValAsnGly 
251 TCCTXSCGTTTTGGTTCAGCAACACATAAGGCTATTGTATACGTTAACC^ 

GluLeuValValGluHisLysGlyGlyPheLeuProPheGluAlaGluIl 
301 GAACTAGTAGTTGAACACAAAGGCGGCTTCTTACCGTTTGAGGCAGAAAT 

eAsnAsnSerLe\xArgAspGlyMetAsnArgValThrValAlaValAspA 
351 AAACAACAGGTTAAGAGACGGAATGAATCGTGTAACAGTAGCGGTTGATA 

snileLeuAspAspSerThrLeuProValGlyLeuTyrSerGluArgHis 
401 ATATTTTAGATGATTCTACGCTCCCAGTTGGGCTATATAGTGAAAGACAT 

GluGluGlyLeuGlyLys VallleArgAsnLys ProAsnPheAspPhePh 
451 GAAGAAGGTTTGGGAAAAGTGATTCGTAATAAACCTAATTTTGACTTCTT 

eAsnTyrAlaGlyLeuHisArgProValLysIleTyrThrThrProPheT 
501 TAACTATGCAGGCTTACATCGTCCTGTAAAAATTTATACAACCCCTTTTA 

hrTyrValGliiAspIleSerValValThrAspPheAsnGlyProThrGly 
551 CCTATGTTGAGGATATATCGGTTGTAACCGATTTTAACGGTCCAACGGGA 

ThrValThrTyrThrValAspPHeGlnGlyLysAlaGluThrValLysVa 
601 ACAGTTACGTATACAGTTGATTTTCAGGGTAAGGCAGAAACCGTAAAGGT 

ISerValValAspGluGluGlyLysValValAlaSerThrGluGlyLeuS 
651 TAGTCTAGTTGATGAAGAAGGGAAAGTTGTTGCTTCAACTGAAGGCCTCT 
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FIGURE 4B 
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erGlyAsnValGluIleProAsnVallleLeuTrpGluProLeuAsnThr 
CTGGTAATGTTGAGATTCCTAACGTTATCCTTTGGGAACC^ 

TyrLeuTyrGlnlleLysValGluLeuValAsnAspGlyLeuThrlleAs 
. TATCTCTATCAAATTAAAGTTGAGTTAGTAAATGATGGTCTAACTATTGA 

pValTyrGluGluProPheGlyValArgThrValGluValAsnAspGlyL 
TGTATACGAAGAGCCATTTGGAGTTCGAACCGTTGAAGTAAACGACGGGA 

ysPheLeuIleAsnAsnLysProPheTyrPheLysGlyPheGlyLysHis 
AATTCCTCATTAATAACAAACCATTTTATTTTAAAGGGTTCGG^^^ 

GluAspThrProIleAsnGlyArgGlyPheAsnGluAlaSerAsnValMe 
GAGGATACTCCAATAAATGGAAGAGGCTTTAATGAAGCATCAAATGTAAT 

tAspPheAsnl leLeuLy sTrpI leGlyAlaAsnSerPheArgThrAlaH 
GGATTTTAATATTTTGAAATGGATCGGTGCGAATTCCrn 

IsTyrProTyrSerGluGluLeiiMetArgLeuAlaAspArgGluGlyLeu 
ACTATCCTTATTCTGAAGAACTGATGCGGCTCGCAGATCGTGAAGGGTTA 

ValVallleAspGluThrProAlaValGlyValHisLeuAsriPheMetAl 
GTCGTCATAGATGAAACCCCAGCAGTTGGTGTTCATTTGAACTTTATOT 

aThrThrGlyLeuGlyGluGlySerGluArgValSerThrTrpGluLysI , . 
AACGACTGGTTTGGGCGAAGGTTCAGAGAGAGTGAGTACTTGGGAAAAAA 

leArgThrPheGluHisHisGlnAspValLeuArgGluLeuValSerArg 
TCCGGACCTTTGAACATCATCAA.GATGTACTGAGAGAGCTGGTTTCTCGT 

AspLysAsnHisProSerValValMetTrpSerlleAlaAsnGliiAlaAl 
GATAAAAACCACCCCTCTGTTGTCATGTGGTCGATTGCAAATGAAGCGGC 

aTlirGluGluGluGlyAlaTyrGluTyrPheLysProLeuValGluLeuT 
TACGGAAGAAGAAGGCGCTTATGAATACTTTAAGCCATTAGTTGAATTAA 

hrLysGluLeuAspProGlnLysArgProValThrlleValLeuPheVal 
CGAAAGAATTAGATCCACAAAAACGCCCAGTTACCATTGTTTTGTTCGTA 

MetAlaThrProGluThrAspLysValAlaGluLeuIleAspVallleAl 
ATGGCGACACCAGAAACAGATAAAGTGGCGGAGTTAATTGATGTGATTGC 

aLeuAsnArgTyrAsnGlyTrpTyrPheAspGlyGlyAspLeuGluAlaA 
ATTGAATCGATACAACGGCTGGTATTTTGATGGGGGTGATCTTGAAGCCG 



wo 00/55333 



# 



7 / 41 



PCT/USOO/07107 



FIGURE 4C 



1451 
1501 
1551 
1601 
1651 
1701 
1751 



laLysValHisLeuArgGlnGluPheHisAlaTrpAsnliysArgCysPro 
CGAAAGTCGACCTTCGTCAGGAATTTCATGTOTGGAATAAACGCTGTCCA 

GlyLysProIleMetlleThrGluTyrGlyAlaAspThrValAlaGlyPh 
GGAAAACCTATAATGATAACAGAGTATGGGGCTGATACCGTAGCTGGTTT 

eHisAspIleAspProValMetPheThrGluGluTyrGlnValGluTyrT 
TCATGATATTGATCCGGTTATGTTTACAGAAGAGTATCAGGTTGAATATT 

yrGlnAlaAsnHisValValPheAspGluPheGluAsnPheValGlyGlu 
ACCAAGCJU^TCATGTAGTATTTGATGAATTTGAGAACTTO 

GlnAlaTrpAsnPheAlaAspPheAlaThrSerGlnGlyValMetArgVa 
CAGGCCTGGAATTTTGCAGACTTTGCTACaAGCCAGGGTGTCATGCGTC 

IGlnGlyAsnLysLysGlyValPheThrArgAspArgLysProLysLeuA 
TCAAGGTAACAAAAAAGGTGTTTTCACACGCGACCGCAAACCAAAATTAG 

laAlaHisValPheArgGluArgTrpThrAsnlleProAspPheGlyTyr 
CAGCACATGTTTTCCGCGAACGTTGGACAAACATCCCGGATTTCGGT^ 



1801 



LysAsn 
AAAAAT 
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FIGURE 4D 
Enterobacter/SalmoneUa fi-glucuronidase gene 

CATTGGGGAAACTTTCCCCCACACCrrACTGCGTAmTTCAGGATGTTACG 5 0 
GTOmrACTGATGTTTTGGAAAATACTGAACA^ 100 
ATGTGGGGGCTGATGGTGATATTCGGGTTGAGCTTCGCGATGGGCAGCAA 15 0 
CAAAlAGTGGCACAAGGGCrGGGGGCCACAGGTATATTTGAACT 200 
TCCTCATCTTTGGGAACCT^TGAAGGGTATTTGTACG^ 250 
CCTGCGAAGCCAATGGTGAGTGTGACGAATATCCAGTACGTGTCGGTATC 300 
CGTTCCATXACGGOTAAGGGTGAGCAGTTTTTGATTAACCAC^^ 350 
TTATTTAACCCGGTTTTGGTCGACATGAAGATGCAGATrTTCGCGGCAAA 400 
GGTTTCGACCCGGGTGTTGATGGTTCACGACCACGCGTTGATGAACTGGA 45 0 
TTGGGCTAACTCCrrATCGCACGTCCCACTACCCTTACGCGQAAAAGATGC 500 
TCGATTGGGCTGATGAGCACGTATCGTAGTGATTAATGAAACCGCGGCGG 550 
GTGGCTTTAACACTTTATCGTTGGGAATCACTCT 600 
CCTAAAGAACTTCTACAGCGAAGAGGCGATTAATGGCGAGACTTCAGCAG 650 
GCTCACTTGCAGGCTATAAAAGAGCITATTGCCCGGGATAAAAACm 70 0 

AAGTGTAGTGTGTGGAGTATTGCCAATGAGCCCGACACCCGTCCAAATGG 750 
AGCGAGAGAGTACTTTGCGCCTTTAGCTAAGGCCACTCGTGAACTGGATC 800 
CGACACGTCCGAaTACCTGCGTAAACGTGATGTTCTGCaA.TGCCGAAAGC 850 
GACACCATCACCGACCTGTTCGACGTGGTTTGTCTGAATCGCTATTACGG 900 
CTGGTATGTGCAATCAGGTGATTTGGAAAAAGCAGAACAGATGCTGGAGC 950 
AAGAACTGCTGGCCrGGCAGTCAAAACTACATCGCCCAATTATTATTAra 1000 
GAATACGGTGTCGATACGCTGGCAGGAATGCCCTCGGTTTATCCCGACAT 1050 
GTGGAGTGAAAAGTACCAGTGAAATGGCTTGAAATGTATCACCGTGTCTT 1100 
TGACCGGGGGAGCGTTTGCAAGCGCNAAGCTTAGTTAACACCGGNGGTAC 1150 
CGATCACGCGTTtAGGCGCCNCCCATGGNCATATGNGCTAGCNTGCGGCCG 1200 
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FIGURE 4E 

CNATGCATTCTGCAGCGATCGCAGCTGAGTACACGAGCrCACCC^ 1250 

TCGACAAGATCCAAGTACTACCCGGGICVTACGTAACTAGTGCATGCTCGC 1300 

GAAAIATTTAGGCCTTATCGAATTAAT 1328 

Pseudomonas fi-D-glucuronidase 

CTTGCTGGAOTACNGTTMAGGATTTTTAGA 5 0 

TGACCNAACTATCACGCCGGNCGTGCANGCrTGGACCGCGACATTNCCTC 100 

ACT^GNaAAANACTCCGCCATATCCATCITTGCTGGCCCAAa^ 150 

lIACNGTNNCGNACinm^NGANGGATCAGTGNATCGAGCTCC^ 200 

CrmCGCTAACATAACATGTNGCATATGTCAATllAATl^CGCTGOT 250 

ANCNCACCGGGCTNATTCGmXSNNATTCGAATTGNATGNC^ 300 

NTGCACGNTGGNAAANAATTGCGTNACAGGGACrrTTGGCCNCTTCCT 350 

CCATNGCATCCTCCCNATGGGCTGTACACGAATGNGCCCCCAAA;^ 400 

TTCAGAAAGGCAATTTOTAACAAGGCNGANNTTTGACT:^^ 450 

CAGNNCTGCACCGGACGCTGAAAATGTACANGACCCTGGGTACGTNCNAC 500 

CAAGACATI^AAGTNGTGACCGACTCCATTGTNCTAACCGGGACTGTACC 550 

TATAATGCGGACTATCANGGCAATGCATGACGTNGAANCGACACACCAGG 600 

ATNAGGAAAACAAOTGGTGGNANCNCACCANGCCATGATTGTCACGTTT^ 650 

GTTAGCNTNGANACNAATTa^TTGCTTTNTTAGCTl^^ 700 

NTTTANATTAGAtOTCTNANTGAGACTGT 73 0 

Salmonella ^-glucuronidase 

NCTCATGACCCNCCCNTTTraGTANamsriTTGNNANCTGCT 5 0 

TCACNAOmGGAimCGGGGNGGGTTCGI^CTCTATGGCNCGNGGAAC^^ 100 

ATGtraSGNCNAOTGTTOANGACTGACMACACG^ 150 



wo 00/55333 



10 / 41 



PCT/USOO/07107 



FIGURE 4F 



GCCGAACTATCACTCAGlSrrCNTGNAAGTTGGACT^CAC^ 200 

GNGAAAAGCCCGCCATATCCATACTGTGCTGGCCCAACANTGAGTTC^ 250 

GTCGTCGI^CTim^TGANGGATCACCTGTATCGANCTCCa^T^ 300 

NCAGCTAACATAACTGTGNGCATATGTCAATGNATGACCTG^ 350 

NCACACCGGGCGraATTGOTGNl^TTCGAATTTb^ 400 
TGCANGNTGGAATGAATCTGGGGGCCAGGGACTTTGGCCANCTTCCTNAA 450 

CCATTCGCANCCTCCCCCAGTGGGCTTGTACACNATTGNGCCCCAAAAAG 500 

GCOTCAGATAGGCATTTTGACAAGCTCCANiriTAAC^^ 550 

NGNCCTGCACCGGACGCTGAAAAANGTACANGANCCTTGTACGTTCCACC 600 

AAGANATTTAAGGTGTGACCCa.Cm'CCATTTTCCrrAA^ 650 

NATAAAGGNTGACCOTTC7\NGGACACATTGCAATGACCCT^ 700 

ANAACCCCCGGITITAAAGGAAAAACAAATTTGGTTGGGWAGTCCAJICC^ 75 0 

GGGCCAATTAimrGTTNCNCGGGGGAISrrAAANCCCCasrCC^ 800 

CGAAATTTAAACAGCGCTCCGGCCGCCACGTGCGAATTCCGATATCGGAT 850 

GAGGCCAGCGCNAAGCTTAGTTAACACCGGNGGTACCGATCACGCGTNAG 90 0 

GCGCCWCCCATGGNCATATGNGCTAGCOTGCGGCCGCNATGCATTCTGCA 95 0 

GCGATCGCAGCTGAGTACACGAGCTCACCCGCGGAGTCGACAAGATCCAA 1000 

GTACTACCCGGGNATACGTAACTAGTGCATGCTCGCGAAATATTTAGGCC 1050 

TTATCGAATTAA 1063 

Staphylococcus wameri fi-giucuronidase 

TANANCITGTNTCrrGCTGCACCCNATCACGACAGGGACCCG^ 5 0 

CGCGCTCTATGGCNCGNGGAACTTAATGCTGGACTACGGTTNAGGACTGA 100 

CAGACACGTGGACTNAAAGCTTGCTGACCGAACTATCACGACTGGTCGTG 15 0 

CTAAGTTGGACCACACATTNCCTGACAGGGGAAANACCCGCCATATCC^ 200 
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FIGURE 4G 



CTTGTGGCCCAACAGTGAGTTAACCGTGTCGANCITATATGANGGATC^ 250 
TGNATTCGAGCTCGOTCTTATGTTCTTCGCTAAC^ 300 
TGTCAATANGTGACNCTGGNCGTGGATCACACCGGGCTNATTGW^^ 350 
CGAATTTATGTCAACAACTTGTTGCT^GNTGGATGAAT^^ 400 
CTTTGGCCT^CATCCTATACCATNGCATCCTTCCCCATGGGCT^ 450 
AAGCGCCACGAAAANGGCCTCGGAAAAGNCAATrTTTACNGGCTCCAC^ 500 
TGC Nri ' 1 ' l ' l 'CAAOTATGCTGANCTGNACCGGACGGTNANAATGTA 550 
ACCTTGTACGTO^CAAGACATTTAGGTTGTGACCGNT^^ 600 
TNOTAAACAGTAGAACAATGTGTGANCCOTAACTAAAAAATANAC^ 650 
TAAAATCACGATTCTGGATGAAAATGATCATGCAATANCCGAAAGCGAAG 700 
GCGCTAAAGGCAATGTAACTATTCAAAATCCrATATTGTGGCAACCTTTA 75 0 
CATGCCTATTTATACAATATGAAAGTAGAATTACTCAACGATAATGAGTG 800 
TGTAGATGTTTATACAGAACGTTTCGGTATTCGATCTGTNGAAGTGAAGG 850 
ATGGACAGTTTTTAATTAATGACMACCATTTT 900 
AAACATGAAGATACCTATTAAAATGGTCGAGGCrTAAACGAATCAGCC^ 950 
CGTCATGGACATCAACirAATGAAATGGATAGGTGCTAATTCATTTAGAA 1000 
CCTCrCATTACCCATATTCAGAAGAAATGATGCGTTTAGCAGATGAAC^ 1050 
GGTATTGTAGTGATAGATGAGACAACANGTGTCGGTATACATCITAAT^ 1100 
TATGGNNACCTTAGGTGGCTCCNTTGCACATGATACATGGAANGAAT^ 1150 
ACACrCTCGAGTTTCATAAAGAAGTCATANAAGACITGATTGNGAG^ 1200 
AAGAATCATGCATGGGTAGTCATGTGGTNATTTGGCAATGAGCNAG 1250 
AAATAAAGGGGGTGCTAAAGCATNCTTTGAGCCATTTGTTAATT^^ 1300 
GTGAAAAAGATNOTCaSTGNimJGCCCAGTGACrATCGTTACT 1350 
GCNNANCGAAATGTATGTGAAGTTI^NAGATTTAGTCGATGTGG^^ 1400 
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FIGURE 4H 



NmNAGNNNISrr3^GGTTGGTATNCAC3\AT^^ 1450 

AACKAGCATTAGATAAGGAGiraVCaTCGAATGGTGGAAANGACT^ 1500 

AAGCCAATNATGTTTACAGAGTATGGTGTGGATANNGTTGTAGG 1550 

NNa3ATNCCTGATAAAATGCNNCCA(3AAGAGTATAAAATGA£^^ 1600 

AAGGtTTATNATAAAATTATGGAlAAACGATCGCAGCTGAGlACACGAGCT 1650 

CACCCGCGGAGTCGACAAGATCCAAGXACTACCCGGGNATACGTAACTAG 1700 

TGCATGCTCGCGAAATATTTAGGCCTTATCGAATTAAT 1739 

Staphylococcus homini ^-glucuronidase gene 

TGTGGGNCTTTGTTCCTTGISrrCAGCrCCCCAACGGCI^ 5 0 

CGCGCCCTCTTCCTCAGTCGCCGCCTCGTTGGCGATGCTCCACATCACGA 100 

CGCTTCGATGGTTCTTGTCACGAGACACCAGTTCACGGAGAACGTCCT 150 

TGGTGCTCAAACGTCCGAATCTTCTCCCAGGTACTGACGCGCTCGCTGCC 200 

TTCGCCGAGTCCCGTGGTGGCCATGAAGTTGAGGTGCACGCCAACTGCCG 250 

GAGTCTCGTCGATCACGACCAGACCCTCGCGATCCGCAAGACGCATCAAC 300 

TCTTCAGAGTACGGATAGTGTGCGGTCCGGAAGCTGTTGGCGCCGATCCA 350 

TTTGAGGATATTGA?^TCCATCACATTGCTCGCTTCGTTAAAGCCACGGC 400 

CGTTGATAGGAGTGTCCTCATGTTTGCCAAAGCCCTTGAAGTAGAACGGT 450 

TTGTTGTTGATGAGGAACTTGCCGTCGTTGACTTCACGGTCCGCACGCCG 500 

AACGGCTCTTCATAGACATCGATGGTCAAGTCCCGTCGTTmCCAGTTCC 550 

ACTTTGATCTGGTAGAGATACGTGTTCAAGTGGTTCCCAGAGGATGAC^ 600 

TCGGAATCTTCACGTTACCGCTCAAGCC 62 9 
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Thermotoga maridma fi-glucuronidase 

ATGGTAAGACaSCAACGAAAaU^GAAGAGATTTATTC^^ 5 0 

AGTTTGGAATCTTGAAGTAACCAGCAAAGACAGACCAATCG^ 10 0 

GAAGCTGGAATGAGCAGTACCAGGATCrGTGCTACGAAGAAGGACCCTrC 15 0 
ACCTACAAAACCACCITCTACGTTCCGAAGNAACITTC^ 200 
CAGACITOACTTTGCTGCGGTGAACACGGACTGCGA^ 250 
GAGAGAAAGTGGGAGAGAATCACATTGAATACCTTCCCTTCGAAGTAGAT 300 
GTGA.a3GGGAAAGTGAAATCCGGAGAGAACGAACTCAGGGTGGTTGT^ 350 
GAACAGATTGAAAGTGGGAGGATTTCCCTCGAAGGTTCCAGAC^ 400 
CrCACACCGTGGGATTTTTTGGAAGTTTTCCACCTGCT^ 450 
TTCCCCTACGGTGGAATCTITAAGGCCTGTTCTGATAGAGTTCACAGACCA 500 
CGCGAGGATACT CGACATCTGGGTQGACACGAGTGAGTCTGAACCGGAGA 550 
AGAAACTTGGAAAAGTGAAAGTGAAGATAGAAGTCTCAGAAGAAGCGGTG 600 
GGACAGGAGATGACGATCAAACTTGGAGAGGAAGAGAAAAAGATTAGAAC 65 0 
ATCCAACAGATTCGTCGAAGGGGAGTTCATCCTCGAAAACGCCAGG'rrCT 700 
GGAGCCTCGAAGATCCATATCTTTATCCrCTCAAGGTGGAACTTG?^AAA^ 75 0 
GAOSAGTACACrCTGGACATCGGAATCAGAACGATCAGCTGGGACGAG^ 800 
GAGGCTCTATCTGAACGGGAAACCTGTCTTTTTGAAGGGCTTTGGAAAGC 850 
ACGAGGAATTCCCCGTTCTGGGGCAGGGCACCTTTTATCCATTGATC^ 900 
AAAGACTTCAACCTTCTGAAGTGGATCAACGCGAA'iTL"l"lTCAGGACCTC 95 0 
TCACIATCCnTACAGTGAAGAGTGGCTGGATCTTGCCGACAGACTCGGAA 1000 
TCCTTGTGATAGACGAAGCCCCGCACGTTGGTATCACAAGGTACCACTAC 1050 
AATCCCGAGACTCAGAAGATAGCAGAAGACAACATAAG7AGAATGATCGA 1100 
CAGACACAAGAACCATCCCAGTGTGATCATGTGGAGTGTGGCGAACGAAC 1150 
CAGAGTCCAACCATCCAGACGCGGAGGGTTTCTrCAAAGCCCTTTA 1200 
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FIGURE 4J 



ACTGCO^TGAAATGGATCGAACACGCCCCGTTGTCATGGTGAGCAT^ 1250 

GGACGCACCAGACGAGAGAACAAGAGACGTGGCGCTGJ^TACTTCGAC^ 1300 

TCGTCTGTGTGAACAGGTACTAOSGCraGTA^ 1350 

GAAGAAGGACTTCAAGCTCTGGAAAAAGACATAGAAGAGCT^ 1400 

GCACAGAAAGCCCATCTITCTCACAGAATTCGGTGCGGACGCGATM 1450 

GCATCCACTACGATCCACCTCaAATGrrCrCCGAAGAGTACCAAC^^ 1500 

CI^TTGAAAAGACGATCAGGCTCCTTTTGAAAAAAGACT^ 1550 

AACACACGTGTGGGCCTTTGCAGArrri'AAGACTCCTC2«LAATC 1600 

GACCCATTCTCAACCACAAGGGTGTTTTCACAAGAGACAGAC^ 1650 

CTCGTTGCTCATGTACTGAGAAGACTGTGGAGTGAGGTT 1689 
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MtiYP INTBTRGVFDLNGVWNFKLDYG KGLEEKWYESKLTDT ISMAVP 4 7 

LGLQGG^ILYPQESPSRECKEIlDGLWSFRADFSDNRRRGFEEQWYRRPLWESGPTVDMPVP 6 0 
MliRPVETPTREIKKLDGLWAFSIiDREN C6IDQRWWESALQESR AIAVP 4 8 



SSYNDIGVTKEIRNHIGYVWYEREFTVPAYLKD QRIVLRFGSATHKAIVYVNGEIiW 104 

SSFNDI SQDWRLRHFVGWVWyERBVILPERWTQDLRTRVVLRIGSAHSYAIVWVNGVDTL 12 0 
GSFNDQFADADIRNYAGNVWYQREVFIPKGWAG QRIVLRFDAVTHYGKVWVNNQEVM 105 



EHKGGFLPFEAE INNSLRDG MNRVTVAVDNILDDSTLPVG- LYSERHEEGLGKVIR 15 9 

EHEGGYLPFEADISNLVQVGPLPSRLRITIAINNTLTPTTLPPGTIQYLTDTSKYPKGYF 180 
EHQGGYTPFEADVTPYVIAG KSVRITVCVNNELNWQTIPPG- -MVITDENGKKK 157 



- NKPNFDFFNYAGLHRPVKIY 'ITPFT yVEDI SWTDFNGPT- - GTVTYTVDFQG - KZ^TV 215 
VQNTYFDFFNYAGLQRSVLLYTTPTTYIDDITVTTSVEQDS - - GLVNYQISVKGSNLFKL 23 8 

- QSYFHDFFNYAGIHRSVMLYTTPNTWVDD ITWTHVAQDCNHAS VDWQWANG DV 212 



KVSWDEEGKWASTEGLSGNVEIPNVILWEP LNTYLYQIKVELVNDGLT ID 267 

EVRLLDAENKVVANGTGTCKSQLKVPGVSLWWP YLMHERPAYLYSLEVQLTAQTSIiGPVSD 298 
SVEIJRDADQQWATGQGTSGTLQWNPHLWQP GEGYLYELCVTAKSQTEC D 263 



VYEEPFGVRTVEVNDGKFL INNKPFYFKGFGKHEDTPINGRGFNEASNVMDFNILKWIGA 327 
FYTIiPVG IRTVAVTKSQFL INGKPFYFHGVNKHEDAD IRGKGFDWPLLVKDFNLLRWLGA 358 
I YPLRVGIRSVAVKGEQFLINHKPFYFTGFGRHEDADLRGKGFDNVIJ^IVHDHAIjMDWIGA 323 



NSFRTAHYP YSEEIJWIRIJUDREGLWIDETPAVGVHLNFNIATTGLGEGSERVSTWEKIR - - 3 85 

NAFRTSHYPYAEEVMQMCDRYGI WIDECPGVGLAL P QFFNNV 4 01 

NSYRTSHYPYAEEMLDWADEHGIWIDETAAVGFNLSLGIGFEAGNKPKELYSEEAVNGE 383 



TFEHHQDVLRELVSRDKNHPSWMWSIANEAATEEEGAYEYFKPLVELTKELDPQKRPVT 445 
SIJIHHMQVMEEVVRRDKNHPAVVMWS VANEPASHLESAGYYLKMVIAHTKSLD - RPVT 4 60 
TQOAHIiQAIKELIARDKNHPSWMWSIANEPDTRPQGAREYFAPLAEATRKLDPT-RPIT 442 



IVLFVMATPETDKVAELIDVIALNRYNGWYFDGGDLEAAKVHLRQEFHAWNKRCPGKPIM 5 05 
FVS- -NSNYAADKGAPYVDVICLNSYYSWYHDYGHLELIQLQLATQFENWYKKYQ-KPII 517 
CVNVMFC15AHTDTI SDLFDVLCLNRYYGWYVQSGDLETAEKVLEKELLAWQEKIiH - QP 1 1 501 



ITE YGADTVAGFTO I DPVMFTEE YQVEYYQANHWFD - - EFENFVGEQAWNFADFATSQG 5 63 
QSEYGAETIAGFHQDPPIiMFTEEYQKSLI^QYHIXSIXtQKRRKYVVGELIWNFADFMTEQS 577 
ITEYGVDTLAGLHSMYTDMWSEE YQCAWLDMYHRVFD - - RVSAWGEQVWNFADFATSQG 559 



VMRVQGNKKGVFTRDRKPKLAAHVFRERWTNIPDFGYKN 602 

PTRVLGNKKG I FTRQRQPKSAAFLLRERYWK IAN- ET 613 

ILRVGGNKKGIFTRDRKPKSAAFIiLQKRWTGMNFGEKPQQGGKQ 603 
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Stapbylococciu ■ 

SCaph honuL : 

St;aph_*iram 
T he rmoCog a 
Knb/SaXmon 
S coli 



MVD LT S EYTTI ^n-CTRCVT-DiiNCrvW^ - KG LEEKWYE S KI.TOT 1 5MAVP S S Y 

LXLLHPlTTGTRGGFAZiVGXXNLMUDYG-XGLT DTWTXSLLTaLSRLVVI.SWT 

MVR^QRNKKRFILHaJGVWNl-EVTSK D- RP IA.VP<;SW 

MliRPVETPTREI KJCLDGLWAFSLDRENCGl OORVVESAl^ESRAIAyPGSF 
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Staphylococcus . 
Staph_homi- : 
St:aph_wacn: 
Thecmo^oga: 
Bnb/SaXmon : 
E coXi. : 



NDlGVTKElMJHTGY^A.^V^:REFTyPJVYLKDQR — I Vl^RrGSATHKAIVYVNGELW i 109 

THX-LTGKX-PAISILWPNSELTVSXLYXGSl-XSSSXLCSSLTXHVVXCQXVTLXV : 106 

NEQ--YQbl/rYEEGPFTYKTTrYVPKXL.SQKH 1 RLYFAAVNTDCEVFLNGEaCVG i 8 8 

NDQFADADIRNYAGNVWYQKEVTIPKGWAGOR 1 VLJLFDAVTHYGKVWVNNQEVM : 10 5 



Suphylococcus; 
S t:aph_horai : 
Scaph_wa rn : 
ThermoCoga: 
E nb/ S a Xmon'i 



StHKGGFLPraAEI N-NS LROGMNRVTVAVDN 1 liDDST LPVG LYS ERHEEGIX5KVI R 

jnHTGI.IXXFEFMSTTCCXXDEI.VTGTIAX- - 1 CYHXI LPHGLYRKRHEXGLGKXNF 
roOJI EVlJSirV'D^rrGKVKS GENELRVVVEN - RliXVGGFKS KVP DSGTHT VGFFG S F 

iSHQGGYTP raADVT P YVI AGKSVRI TVCVKKE CHVQT I PP GMVI T DENG KKK 
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Staphylococcus , 
St»ph_homi : 
St:aph_^rfarnt 
Th« ciaocoqa : 
Rnb/SaXmon : 
E coli. : 



NKPNFOTTFN-tAGianiP^ncrYTTPFT-YyE^ TGTVTYTVDFQGKA 



VXLHFAFEKYAXiaCRTVXMYX -N LVTOC' 
PPANFOrgPYGGriRPVLrEFTDHARr 

GKX;SPT'PTAYT 

QS YFH DKFNYAG IHRS VMUYTTPNTV^y I 




-HX XX-TVEQCVXXN- 

ESEP EKKX*GKVKVKI EVSEEA 

DVLEN T EQATVI-G^JCVGAOG 

AQ D CNHASVDWCWANG 
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Staphylococcus . fgErKNrVGEQAWFADFArSQOWRVQCNKKGVFTRIlRKPKlJWIV^ P 
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St;aph_warn: h{5iC 
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Pseud omona: £ r 
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FIGURE ISA 



MetValAspLeuThrSerLeuTyx 
ATACGACTCA CTAGTGG GTC GACCCATGG T AGATCT GACTAGTCTGTAC 



Pro 1 1 eAs nThr GluThr Ar gG ly Va 1 Phe AspLeuAs nG ly Va 1 TrpAs n 
CCGATCAACACCGAGACCCGTGGCGTCTTCGACCTCAATGGCGTCTGGAAC 

PheLysLeuAspTyrGlyLysGlyLeuGluGluLysTrpTyrGluSerLys 
TTCAAGCTGGACTACGGGAAAGGACTGGAAGAGAAGTGGTACGAAAGCAA 

LeuThrAspThrlleSerMetAlaValProSerSerTyrAsnAspIle 
GCTGACCGACACTATTAGTATGGCCGTCCCAAGCAGTTACT^TGACATrG 

G lyVa IThr Lys G lu 1 1 eArgAsnH i s 1 1 eG lyTy rVa ITrpTyrG IxiArg 
GCGTGACCAAGGAAATCCGCAACCATATCGGATATGTCTGGTACGAACGT 

GluPheThrValProAlaTyrLeuLysAspGlnArglleValLeuArgPhe 
GAGTTCACGG TGCCGGCCTATCTGAAGGATCAGCGTATCGTGCTCCGCTT 

G lySer Al aThr H i sLy sAl all eVa iTyrVa 1 As nG lyG luLeuVa 1 
CGGCTCTGCAACTCACAAAGCAATTGTCTATGTCAATGGTGAGCTGGTCG 

ValGluHisLysGlyGlyPheLeuProPheGluAlaGluIleAsnAsnSer 
TGGAGCACAAGGGCGGATTCCTGCCATTCGAAGCGGAAATCAACAACTCG 

LeuArgAspG lyMe t AsnAr gVa iThrVa 1 Al a Va 1 AspAs nl 1 eLeuAsp 
CTGCGTGATGGCATGAATCGCGTCACCGTCGCCGTGGACAACATCCTCGA 

As pS er Thr Leu Pr oVa IG lyLeuTyr S er G 1 uAr gH isGluGluGly 
CGATAGCACCCTCCCGGTGGGGCTGTACAGCGAGCGCCACGAAGAGGGCC 

LeuGlyLys Va 1X1 eArgAsnLys Pr oAs nPheAspPhe PheAsnTyr Al a 
TCGGAAAAGTCATTCGTAACAAGCCGAACTTCGACTTCTTCAACTATGCA 

GlyLeuHisArgProValLysIleTyrThrThrProPheThrTyrValGlu 
GGCCTGCACCGTCCGGTGAAAATCTACACGACCCCGTTTACGTACGTCGA 

AspIleSerValValThrAspPheAsnGlyProThrGlyThrValThr 
GGACATCTCGGTTGTGACCGACTTCAATGGCCCAACCGGGACTGTGACCT 

TyrThrValAspPheGlnGlyLysAlaGluThrValLysValSerValVal 
ATACGGTGGACTTTCAAGGCAAAGCCGAGACCGTGAAAGTGTCGGTCGTG 

AspGluGluGlyLysValValAlaSerThrGluGlyLeuSerGlyAsnVal 
GATGAGGAAGGCAAAGTGGTCGCAAGCACCGAGGGCCTGAGCGGTAACGT 

GluIleProAsnVallleLeuTrpGluProLeuAsnThrTyrLeuTyr 
GGAGATTCCGAATGTCATCCTCTGGGAACCACTGAACACGTATCTCTACC 
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Ncol 



Bglll 
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FIGURE 13B 



G Inl 1 eLysVa 1 GluLeuVa lAsnAspGlyLeuThr 1 1 eAspValTyxGlu 
CAGATCAAAGTGGAACTGGTGAACGACGGACTGACCATCGATGTCTAT(^^ 

G luPr oPheG lyVal ArgThrVa IG 1 uVa lAs nAspG lyLys PheLeu 1 1 e 
GAGCCGTTCGGCGTGCGGACCGTGGAAGTCAACGACGGCAAGTTCCTCAT 

AsnAsnLysProPheTyrPheLysGlyPheGlyLysHisGluAspThr 

c?^caac::aaaccgttctacttcaagggctttggcaaac^ 
ProIleAsnGlyArgGlyPheAsnGluAlaSerAsnValMetAspPheAsn 

ctatcaacggccgtggctttaacgaagcgagcaatgtgatggatttcaat 

I leLeuLysTrpI leGlyAlaAsnSerPheArgThrAlaHisTyr ProTyr 

atcctcaaatggatcggcgccaacagcttccggaccgcacactatccgta 

Se rG 1 uG 1 uLeuMe t Ar gLeuAl aAs pArgG luG lyLeu Va 1 Va 1 1 1 e 
ctctgaagagttgatgcgtcttgcggatcgcgagggtctggtcgtgatcg 

AspGluThrProAlaValGlyValHisLeuAsnPheMetAlaThrThrGly 

acgagactccggcagttggcgtgcacctcaacttcatggccaccacggga 

LeuGlyGluGlySerGluArgValSerThrTrpGluLysIleArgThrPhe 

ctcggcgaaggcagcgagcgcgtcagtacctgggagaagattcggacgtt 

GluHisHisGlnAspValLeuArgGluLeuValSerArgAspLysAsn 

tgagcaccatcaagacgttctccgtgaactggtgtctcgtgacaagaacc 

HisProSerValValMetTrpSerlleAlaAsnGluAlaAlaThrGluGlu 

atccaagcgtcgtgatgtggagcatcgccaacgaggcggcgactgaggm 

GluGlyAlaTyrGluTyrPheLys ProLeuValGluLeuThrLysGluLeu 

gagggcgcgtacgagtacttcaagccgttggtggagctgaccaaggaact 

AspProGlnLysArgProValThrlleValLeuPheValMetAlaThr 

cgacccacagaagcgtccggtcacgatcgtgctgtttgtgatggctaccc 

ProGluThrAspLysValAlaGluLeuIleAspVallleAlaLeuAsnArg 

cggagacggacaaagtcgccgaactgattgacgtcatcgcgctcaatcgc 

TyrAs nG lyTrpTyr PheAspG lyG lyAs pLeuGluAl aAl aLy s Va IH i s 

tataacggatggtacttcgatggcggtgatctcgaagcggccaaagtcca 

LeuArgGlnGluPheHisAlaTrpAsnLysArgCysProGlyLysPro 

tctccgccaggaatttcacgcgtggaacaagcgttgcccaggaaagccga 

IleMetlleThrGluTyrGlyAlaAspThrValAlaGlyPheHisAspIle 

tcatgatcactgagtacggcgcagacaccgttgcgggctttcacgacatt 



Asp ProVa iMe t PheThrG 1 uG luTyrG 1 nVa IG 1 uTyrTyrG 1 nAl aAs n 

gatccagtgatgttcaccgaggaatatcaagtcgagtactaccaggcgaa 
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FIGURE 13C 



HisValValPheAspGluPheGluAsnPheValGlyGluGlnAlaTrp 
CCACGTCGTGTTCGATGMtTTGAGAACTTCGTGGGTGAGCT^CGTGGA 

AsnPheAl aAspPheAl aThr S erGl nG lyValMe t ArgVa 1 G 1 nG lyAsn 
ACTTCGCGGACTTCGCGACCTCTCAGGGCGTGATGCGCGTCCAAGGAAAC 

LysLysGlyVal PheThrArgAspArgLys ProLysLeuAl aAl aHi s Val 
AAGAAGGGCGTGTTCACTCGTGACCGCAAGCCGAAGCTCGCCGCGCACGT 

PheArgGluArgTrpThr Asm le ProAspPheG lyTyrLysAs n 
CTTTCGCGAGCGCTGGACCAACATTCCAGATTTCGGCTACAAGAAC GCTA 

SerHisHisHisHisHisHisVal * 
GCCATCACCATCACCA TCACGTG TGAAT TGGTGACC G 
Nhel Pmll BstEII 
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FIGURE 14 
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Ncol (S) 
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FIGURE 16 
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ATGTTACGTT CTGTCGAAAC CGCGACGCGA GAAATCAAAA AACTGGACGG 
CCTGTGGTCG TTTTGTATGG ATAGCGAAGA GTGCGGCAAC GCGCAGCAAT 
GGTGGCGTCA ACCGTTACCC CAAAGCCGCG CTATCGCCGT TCCGGGAAGC 
TATAACGATC AGTTTGCCGC TGCCGAGATC CGCAATTATG TTGGCAACGT 
CTGGTATCAG CGTGAGATAC GCATCCCGAA AGGCTGGGAT CGCCAGCGCA 
TAGTGCTGCG CTTTGATGCG GTGACTCACT ATGGAAAAGT TTGGGTCAAT 
GACCAATTTT TAATGGAACA TCAGGGCGGC TACACGCCGT TTGAAGCGGA 
TATCAGCCAC CTTATCTCCG CCGGGGAATC CGTGCGTATC ACGGTATGCG 
TGAATAACGA GCTGAACTGG CAGACGATCC CGCCGGGCGT TGTGACCCAG 
GGCGTAAACG GTAAGAAGCA GCAAGCGTAT TTCCATGATT TCTTTAACTA 
CGCCGGTATT CATCGCAGCG TAATGCTGTA CACCACGCCG AAAACTTTTG 
TGGAAGATAT TACCGTCGTG ACGCAGGTTG CTGACGATCT GGCTCAGGCT 
ACCGTCGCCT GGCAGGTACG GGCGAATGGC GAAGTGCGTG TAGAGCTACG 
TGACGCGGAG CAACAGCTTG TCGCTTCGGG GCAAGGGGAA AAAGGTGAAC 
TGCTGCTGGA AGGGCCGCGG CTGTGGCAGC CTGGCGAGGG CTATCTTTAT 
GAACTGCGGG TCATCGCGCA GCATCAGGAC GAGCAGGATG AATATCCGCT 
GCGCGTCGGT ATTCGCTCGG TAGAAGTAAA AGGGGAGCAG TTCCTGATCA 
ACCATAAGCC TTTCTATTTC ACCGGGTTCG GACGTCATGA AGATGCCGAT 
CTGCGCGGTA AGGGTTTTGA TAACGTGCTG ATGGTGCACG ACCACGCGCT 
AATGGACTGG ATCGGTGCGA ACTCTTACCG TACCTCGCAT TACCCTTATG 
CCGAAGAGAT GCTCGACTGG GCGGACGAAC ATGGCATCGT CATCATTGAT 
GAAACGGCCG CCGTCGGATT CAACCTGTCT TTAGGGATTA GCTTTGATGT 
CGGCGAAAAA CCCAAAGAGC TCTACAGCGA TGAGGCCGTG AACGATGAAA 
CGCAGCGCGC GCACCTGCAG GCAATTAAGG AGCTGATTGC CCGCGATAAG 
AACCACCCAA GCGTCGTGAT GTGGAGTATC GCCAACGAAC CGGATACCCG 
CCCGAACGGC GCGCGCGAAT ACTTCGCTCC GCTGGCGCAG GCAACGCGCG 
AACTCGATCC TACACGTCCG ATAACCTGCG TGAACGTGAT GTTCTGCGAT 
GCGGAAAGCG ACACCATTAC CGATCTCTTT GATGTCGTTT GCCTGAACCG 
CTACTACGGC TGGTATGTAC AAAGCGGCGA TCTGGAGAAG GCTGAGAAAG 
TGCTGGAGAA AGAGCTTCTG GCCTGGCAGG AGAAACTCCA CCGCCCGATT 
ATCATCACCG AATACGGCGT CGATACGCTT GCAGGCCTGC ATTCCATGTA 
CAACGATATG TGGAGCGAAG AGTACCAGTG CGCCTGGCTT GATATGTACC 
ATCGCGTGTT TGATCGCGTC AGCGCCGTCG TCGGCGAGCA GGTATGGAAC 
TTCGCCGACT TCGCCACTTC GCAGGGCATT ATGCGCGTTG GCGGCAACAA 
AAAAGGTATA TTCACCCGCG ACAGAAAACC AAAATCGGCG GCCTTCCTGC 
TGCAAAAACG CTGGACCGGC ATGGACTTTG GCGTGAAGCC CCAGCAGGGA 
GATAAATAAT GA 



wo 00/55333 



31 / 41 



PCTAJSOO/07107 



FIGURE 17 
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MLRSVETATR EIKKLDGLWS FCMDSEECGN AQQWWRQPLP QSRAIAVPGS 
YNDQFAAAEI RNYVGNVWYQ REIRIPKGWD RQRIVLRFDA VTHYGKVWVN 
DQFLMEHQGG YTPFEADISH LI SAGES VRI TVCVNNELNW OTIPPGWTQ 
GVNGKKQQAY FHDFFNYAGI HRSVMLYTTP KTFVEDITW TQVADDLAQA 
TVAWQVRANG EVRVELRDAE QQLVASGQGE KGELLLEGPR LWQPGEGYLY 
ELRVIAQHQD EQDEYPLRVG IRSVEVKGEQ FLINHKPFYF TGFGRHEDAD 
LRGKGFDNVL MVHDHALMDW IGANSYRTSH YPYAEEMLDW ADEHGIVIID 
ETAAVGFNLS LGISFDVGEK PKELYSDEAV NDETQRAHLQ AIKELIARDK 
NHPSWMWSI ANEPDTRPNG AREYFAPLAQ ATRELDPTRP ITCVNVMFCD 
AESDTITDLF DWCLNRYYG WYVQSGDLEK AEKVLEKELL AWQEKLHRPI 
IITEYGVDTL AGLHSMYNDM WSEEYQCAWL DMYHRVFDRV SAWGEQVWN 
FADFATSQGI MRVGGKKKGI FTRDRKPKSA AFLLQKRWTG MDFGVKPQQG 
DK 
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FIG. 18A 
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FIG. 18B 
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FIG. 18C 
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FIG. 19A 




FIG. 19B 
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FIG. 19C 
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FIG. 19D 



wo 00/55333 



Staph 

E.coli 

Sal 



Staph 
E . coli 
Sal 



Staph 

E.coli 

Sal 



Staph 
E , coli 
Sal 



Staph 

E.coli 

Sal 



Staph 

E.coli 

Sal 



Staph 

E.coli 

Sal 



Staph 

E.coli 

Sal 




PCT/USOO/07107 



1169 
1170 
1145 



1202 
1206 
1181 



1238 
1242 
1217 



1274 
1278 
1253 



1310 
1311 
1289 



1346 
1344 
1322 



1382 
1380 
1358 



1418 
1416 
1394 



FIG. 19E 
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FIG. 19F 
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